_
RobertoLofaro.com - Knowledge Portal - human-generated content
Change, with and without technology - human, AI, scraping readers welcome
for updates on publications, follow: on Instagram, Twitter, Patreon, YouTube, Kaggle metadata


_

You are now here: AI Ethics Primer - search within the bibliography - version 0.4 of 2023-12-13 > (tag cloud) >tag_selected: heuristic


Currently searching for:

if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long


if you modify the keywords, press enter within the field to confirm the new search key

Tag: heuristic

Bibliography items where occurs: 119
A multilevel framework for AI governance / 2307.03198 / ISBN:https://doi.org/10.48550/arXiv.2307.03198 / Published by ArXiv / Version released on 2023-07-13 / on (web) Publishing site


Regulating AI manipulation: Applying Insights from behavioral economics and psychology to enhance the practicality of the EU AI Act / 2308.02041 / ISBN:https://doi.org/10.48550/arXiv.2308.02041 / Published by ArXiv / Version released on 2023-07-24 / on (web) Publishing site


Building Trust in Conversational AI: A Comprehensive Review and Solution Architecture for Explainable, Privacy-Aware Systems using LLMs and Knowledge Graph / 2308.13534 / ISBN:https://doi.org/10.48550/arXiv.2308.13534 / Published by ArXiv / Version released on 2023-08-13 / on (web) Publishing site


The AI Revolution: Opportunities and Challenges for the Finance Sector / 2308.16538 / ISBN:https://doi.org/10.48550/arXiv.2308.16538 / Published by ArXiv / Version released on 2023-08-31 / on (web) Publishing site


A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics / 2310.05694 / ISBN:https://doi.org/10.48550/arXiv.2310.05694 / Published by ArXiv / Version released on 2025-01-27 / on (web) Publishing site


Compromise in Multilateral Negotiations and the Global Regulation of Artificial Intelligence / 2309.17158 / ISBN:https://doi.org/10.48550/arXiv.2309.17158 / Published by ArXiv / Version released on 2023-09-29 / on (web) Publishing site


Toward an Ethics of AI Belief / 2304.14577 / ISBN:https://doi.org/10.48550/arXiv.2304.14577 / Published by ArXiv / Version released on 2024-04-13 / on (web) Publishing site


Responsible AI Pattern Catalogue: A Collection of Best Practices for AI Governance and Engineering / 2209.04963 / ISBN:https://doi.org/10.48550/arXiv.2209.04963 / Published by ArXiv / Version released on 2023-09-28 / on (web) Publishing site


LLMs grasp morality in concept / 2311.02294 / ISBN:https://doi.org/10.48550/arXiv.2311.02294 / Published by ArXiv / Version released on 2023-11-04 / on (web) Publishing site


Towards Effective Paraphrasing for Information Disguise / 2311.05018 / ISBN:https://doi.org/10.1007/978-3-031-28238-6_22 / Published by ArXiv / Version released on 2023-11-08 / on (web) Publishing site


Synergizing Human-AI Agency: A Guide of 23 Heuristics for Service Co-Creation with LLM-Based Agents / 2310.15065 / ISBN:https://doi.org/10.48550/arXiv.2310.15065 / Published by ArXiv / Version released on 2023-11-29 / on (web) Publishing site


GPT in Data Science: A Practical Exploration of Model Selection / 2311.11516 / ISBN:https://doi.org/10.48550/arXiv.2311.11516 / Published by ArXiv / Version released on 2023-11-20 / on (web) Publishing site


Survey on AI Ethics: A Socio-technical Perspective / 2311.17228 / ISBN:https://doi.org/10.48550/arXiv.2311.17228 / Published by ArXiv / Version released on 2025-11-04 / on (web) Publishing site


Ethical Considerations Towards Protestware / 2306.10019 / ISBN:https://doi.org/10.48550/arXiv.2306.10019 / Published by ArXiv / Version released on 2024-01-05 / on (web) Publishing site


Culturally-Attuned Moral Machines: Implicit Learning of Human Value Systems by AI through Inverse Reinforcement Learning / 2312.17479 / ISBN:https://doi.org/10.48550/arXiv.2312.17479 / Published by ArXiv / Version released on 2023-12-29 / on (web) Publishing site


Exploring the Frontiers of LLMs in Psychological Applications: A Comprehensive Review / 2401.01519 / ISBN:https://doi.org/10.48550/arXiv.2401.01519 / Published by ArXiv / Version released on 2025-04-20 / on (web) Publishing site


A Scoping Study of Evaluation Practices for Responsible AI Tools: Steps Towards Effectiveness Evaluations / 2401.17486 / ISBN:https://doi.org/10.48550/arXiv.2401.17486 / Published by ArXiv / Version released on 2024-01-30 / on (web) Publishing site


Trust and ethical considerations in a multi-modal, explainable AI-driven chatbot tutoring system: The case of collaboratively solving Rubik's CubeĆ  / 2402.01760 / ISBN:https://doi.org/10.48550/arXiv.2402.01760 / Published by ArXiv / Version released on 2024-08-27 / on (web) Publishing site


How do machines learn? Evaluating the AIcon2abs method / 2401.07386 / ISBN:https://doi.org/10.48550/arXiv.2401.07386 / Published by ArXiv / Version released on 2024-07-24 / on (web) Publishing site


User Modeling and User Profiling: A Comprehensive Survey / 2402.09660 / ISBN:https://doi.org/10.48550/arXiv.2402.09660 / Published by ArXiv / Version released on 2024-02-20 / on (web) Publishing site


Towards an AI-Enhanced Cyber Threat Intelligence Processing Pipeline / 2403.03265 / ISBN:https://doi.org/10.48550/arXiv.2403.03265 / Published by ArXiv / Version released on 2024-03-05 / on (web) Publishing site


A Survey on Human-AI Collaboration with Large Foundation Models / 2403.04931 / ISBN:https://doi.org/10.48550/arXiv.2403.04931 / Published by ArXiv / Version released on 2025-09-02 / on (web) Publishing site


Responsible Artificial Intelligence: A Structured Literature Review / 2403.06910 / ISBN:https://doi.org/10.48550/arXiv.2403.06910 / Published by ArXiv / Version released on 2024-03-11 / on (web) Publishing site


Power and Play Investigating License to Critique in Teams AI Ethics Discussions / 2403.19049 / ISBN:https://doi.org/10.48550/arXiv.2403.19049 / Published by ArXiv / Version released on 2024-04-08 / on (web) Publishing site


Designing for Human-Agent Alignment: Understanding what humans want from their agents / 2404.04289 / ISBN:https://doi.org/10.1145/3613905.3650948 / Published by ArXiv / Version released on 2024-04-04 / on (web) Publishing site


Frontier AI Ethics: Anticipating and Evaluating the Societal Impacts of Language Model Agents / 2404.06750 / ISBN:https://arxiv.org/abs/2404.06750 / Published by ArXiv / Version released on 2024-10-18 / on (web) Publishing site


A Critical Survey on Fairness Benefits of Explainable AI / 2310.13007 / ISBN:https://doi.org/10.1145/3630106.3658990 / Published by ArXiv / Version released on 2024-05-07 / on (web) Publishing site


AI Alignment: A Comprehensive Survey / 2310.19852 / ISBN:https://doi.org/10.48550/arXiv.2310.19852 / Published by ArXiv / Version released on 2025-04-04 / on (web) Publishing site


Characterizing and modeling harms from interactions with design patterns in AI interfaces / 2404.11370 / ISBN:https://doi.org/10.48550/arXiv.2404.11370 / Published by ArXiv / Version released on 2024-05-20 / on (web) Publishing site


Taxonomy to Regulation: A (Geo)Political Taxonomy for AI Risks and Regulatory Measures in the EU AI Act / 2404.11476 / ISBN:https://doi.org/10.48550/arXiv.2404.11476 / Published by ArXiv / Version released on 2024-04-17 / on (web) Publishing site


Modeling Emotions and Ethics with Large Language Models / 2404.13071 / ISBN:https://doi.org/10.48550/arXiv.2404.13071 / Published by ArXiv / Version released on 2024-06-25 / on (web) Publishing site


Integrating Emotional and Linguistic Models for Ethical Compliance in Large Language Models / 2405.07076 / ISBN:https://doi.org/10.48550/arXiv.2405.07076 / Published by ArXiv / Version released on 2024-05-14 / on (web) Publishing site


A Comprehensive Overview of Large Language Models (LLMs) for Cyber Defences: Opportunities and Directions / 2405.14487 / ISBN:https://doi.org/10.48550/arXiv.2405.14487 / Published by ArXiv / Version released on 2024-05-23 / on (web) Publishing site


Responsible AI for Earth Observation / 2405.20868 / ISBN:https://doi.org/10.48550/arXiv.2405.20868 / Published by ArXiv / Version released on 2024-05-31 / on (web) Publishing site


Deception Analysis with Artificial Intelligence: An Interdisciplinary Perspective / 2406.05724 / ISBN:https://doi.org/10.48550/arXiv.2406.05724 / Published by ArXiv / Version released on 2024-06-11 / on (web) Publishing site


Documenting Ethical Considerations in Open Source AI Models / 2406.18071 / ISBN:https://doi.org/10.48550/arXiv.2406.18071 / Published by ArXiv / Version released on 2024-07-03 / on (web) Publishing site


Nudging Using Autonomous Agents: Risks and Ethical Considerations / 2407.16362 / ISBN:https://doi.org/10.48550/arXiv.2407.16362 / Published by ArXiv / Version released on 2024-07-23 / on (web) Publishing site


Interactive embodied evolution for socially adept Artificial General Creatures / 2407.21357 / ISBN:https://doi.org/10.48550/arXiv.2407.21357 / Published by ArXiv / Version released on 2024-07-31 / on (web) Publishing site


Surveys Considered Harmful? Reflecting on the Use of Surveys in AI Research, Development, and Governance / 2408.01458 / ISBN:https://doi.org/10.48550/arXiv.2408.01458 / Published by ArXiv / Version released on 2024-07-26 / on (web) Publishing site


The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources / 2406.16746 / ISBN:https://doi.org/10.48550/arXiv.2406.16746 / Published by ArXiv / Version released on 2024-09-03 / on (web) Publishing site


Neuro-Symbolic AI for Military Applications / 2408.09224 / ISBN:https://doi.org/10.48550/arXiv.2408.09224 / Published by ArXiv / Version released on 2024-08-24 / on (web) Publishing site


Catalog of General Ethical Requirements for AI Certification / 2408.12289 / ISBN:https://doi.org/10.48550/arXiv.2408.12289 / Published by ArXiv / Version released on 2024-11-15 / on (web) Publishing site


Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey / 2408.12880 / ISBN:https://doi.org/10.48550/arXiv.2408.12880 / Published by ArXiv / Version released on 2024-08-23 / on (web) Publishing site


Aligning XAI with EU Regulations for Smart Biomedical Devices: A Methodology for Compliance Analysis / 2408.15121 / ISBN:https://doi.org/10.48550/arXiv.2408.15121 / Published by ArXiv / Version released on 2024-08-27 / on (web) Publishing site


Trustworthy and Responsible AI for Human-Centric Autonomous Decision-Making Systems / 2408.15550 / ISBN:https://doi.org/10.48550/arXiv.2408.15550 / Published by ArXiv / Version released on 2024-09-02 / on (web) Publishing site


How Mature is Requirements Engineering for AI-based Systems? A Systematic Mapping Study on Practices, Challenges, and Future Research Directions / 2409.07192 / ISBN:https://doi.org/10.48550/arXiv.2409.07192 / Published by ArXiv / Version released on 2024-09-11 / on (web) Publishing site


Improving governance outcomes through AI documentation: Bridging theory and practice / 2409.08960 / ISBN:https://doi.org/10.48550/arXiv.2409.08960 / Published by ArXiv / Version released on 2024-12-09 / on (web) Publishing site


Study on the Helpfulness of Explainable Artificial Intelligence / 2410.11896 / ISBN:https://doi.org/10.48550/arXiv.2410.11896 / Published by ArXiv / Version released on 2024-10-14 / on (web) Publishing site


Standardization Trends on Safety and Trustworthiness Technology for Advanced AI / 2410.22151 / ISBN:https://doi.org/10.48550/arXiv.2410.22151 / Published by ArXiv / Version released on 2024-10-29 / on (web) Publishing site


I Always Felt that Something Was Wrong.: Understanding Compliance Risks and Mitigation Strategies when Professionals Use Large Language Models / 2411.04576 / ISBN:https://doi.org/10.48550/arXiv.2411.04576 / Published by ArXiv / Version released on 2024-11-07 / on (web) Publishing site


Good intentions, unintended consequences: exploring forecasting harms / 2411.16531 / ISBN:https://doi.org/10.48550/arXiv.2411.16531 / Published by ArXiv / Version released on 2025-03-12 / on (web) Publishing site


AI Ethics in Smart Homes: Progress, User Requirements and Challenges / 2412.09813 / ISBN:https://doi.org/10.48550/arXiv.2412.09813 / Published by ArXiv / Version released on 2024-12-13 / on (web) Publishing site


Bots against Bias: Critical Next Steps for Human-Robot Interaction / 2412.12542 / ISBN:https://doi.org/10.1017/9781009386708.023 / Published by ArXiv / Version released on 2024-12-17 / on (web) Publishing site


Large Language Model Safety: A Holistic Survey / 2412.17686 / ISBN:https://doi.org/10.48550/arXiv.2412.17686 / Published by ArXiv / Version released on 2024-12-23 / on (web) Publishing site


Towards A Litmus Test for Common Sense / 2501.09913 / ISBN:https://doi.org/10.48550/arXiv.2501.09913 / Published by ArXiv / Version released on 2025-01-17 / on (web) Publishing site


A Conceptual Exploration of Generative AI-Induced Cognitive Dissonance and its Emergence in University-Level Academic Writing / 2502.05698 / ISBN:https://doi.org/10.48550/arXiv.2502.05698 / Published by ArXiv / Version released on 2025-02-08 / on (web) Publishing site


Prioritization First, Principles Second: An Adaptive Interpretation of Helpful, Honest, and Harmless Principles / 2502.06059 / ISBN:https://doi.org/10.48550/arXiv.2502.06059 / Published by ArXiv / Version released on 2025-10-14 / on (web) Publishing site


Can AI Model the Complexities of Human Moral Decision-Making? A Qualitative Study of Kidney Allocation Decisions / 2503.00940 / ISBN:https://doi.org/10.48550/arXiv.2503.00940 / Published by ArXiv / Version released on 2025-03-02 / on (web) Publishing site


Medical Hallucinations in Foundation Models and Their Impact on Healthcare / 2503.05777 / ISBN:https://doi.org/10.48550/arXiv.2503.05777 / Published by ArXiv / Version released on 2025-02-26 / on (web) Publishing site


Generative AI in Transportation Planning: A Survey / 2503.07158 / ISBN:https://doi.org/10.48550/arXiv.2503.07158 / Published by ArXiv / Version released on 2025-05-07 / on (web) Publishing site


Synthetic Data for Robust AI Model Development in Regulated Enterprises / 2503.12353 / ISBN:https://doi.org/10.48550/arXiv.2503.12353 / Published by ArXiv / Version released on 2025-03-16 / on (web) Publishing site


Advancing Human-Machine Teaming: Concepts, Challenges, and Applications / 2503.16518 / ISBN:https://doi.org/10.48550/arXiv.2503.16518 / Published by ArXiv / Version released on 2025-05-06 / on (web) Publishing site


Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation / 2502.05151 / ISBN:https://doi.org/10.48550/arXiv.2502.05151 / Published by ArXiv / Version released on 2025-04-16 / on (web) Publishing site


Confirmation Bias in Generative AI Chatbots: Mechanisms, Risks, Mitigation Strategies, and Future Research Directions / 2504.09343 / ISBN:https://doi.org/10.48550/arXiv.2504.09343 / Published by ArXiv / Version released on 2025-04-12 / on (web) Publishing site


Framework, Standards, Applications and Best practices of Responsible AI : A Comprehensive Survey / 2504.13979 / ISBN:https://doi.org/10.48550/arXiv.2504.13979 / Published by ArXiv / Version released on 2025-04-18 / on (web) Publishing site


TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models / 2504.20605 / ISBN:https://doi.org/10.48550/arXiv.2504.20605 / Published by ArXiv / Version released on 2025-04-29 / on (web) Publishing site


From Texts to Shields: Convergence of Large Language Models and Cybersecurity / 2505.00841 / ISBN:https://doi.org/10.48550/arXiv.2505.00841 / Published by ArXiv / Version released on 2025-05-01 / on (web) Publishing site


Towards Safer Pretraining: Analyzing and Filtering Harmful Content in Webscale datasets for Responsible LLMs / 2505.02009 / ISBN:https://doi.org/10.48550/arXiv.2505.02009 / Published by ArXiv / Version released on 2025-08-12 / on (web) Publishing site


WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models / 2505.09595 / ISBN:https://doi.org/10.48550/arXiv.2505.09595 / Published by ArXiv / Version released on 2025-05-14 / on (web) Publishing site


Formalising Human-in-the-Loop: Computational Reductions, Failure Modes, and Legal-Moral Responsibility / 2505.10426 / ISBN:https://doi.org/10.48550/arXiv.2505.10426 / Published by ArXiv / Version released on 2025-09-25 / on (web) Publishing site


Beyond Individual UX: Defining Group Experience(GX) as a New Paradigm for Group-centered AI / 2505.12780 / ISBN:https://doi.org/10.48550/arXiv.2505.12780 / Published by ArXiv / Version released on 2025-05-19 / on (web) Publishing site


Can we Debias Social Stereotypes in AI-Generated Images? Examining Text-to-Image Outputs and User Perceptions / 2505.20692 / ISBN:https://doi.org/10.48550/arXiv.2505.20692 / Published by ArXiv / Version released on 2025-05-27 / on (web) Publishing site


Are Language Models Consequentialist or Deontological Moral Reasoners? / 2505.21479 / ISBN:https://doi.org/10.48550/arXiv.2505.21479 / Published by ArXiv / Version released on 2025-10-12 / on (web) Publishing site


Locating Risk: Task Designers and the Challenge of Risk Disclosure in RAI Content Work / 2505.24246 / ISBN:https://doi.org/10.48550/arXiv.2505.24246 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site


Bottom-Up Perspectives on AI Governance: Insights from User Reviews of AI Products / 2506.00080 / ISBN:https://doi.org/10.48550/arXiv.2506.00080 / Published by ArXiv / Version released on 2025-05-30 / on (web) Publishing site


Wide Reflective Equilibrium in LLM Alignment: Bridging Moral Epistemology and AI Safety / 2506.00415 / ISBN:https://doi.org/10.48550/arXiv.2506.00415 / Published by ArXiv / Version released on 2025-05-31 / on (web) Publishing site


Machine vs Machine: Using AI to Tackle Generative AI Threats in Assessment / 2506.02046 / ISBN:https://doi.org/10.48550/arXiv.2506.02046 / Published by ArXiv / Version released on 2025-05-31 / on (web) Publishing site


Subjective Experience in AI Systems: What Do AI Researchers and the Public Believe? / 2506.11945 / ISBN:https://doi.org/10.48550/arXiv.2506.11945 / Published by ArXiv / Version released on 2025-06-13 / on (web) Publishing site


A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications / 2506.12594 / ISBN:https://doi.org/10.48550/arXiv.2506.12594 / Published by ArXiv / Version released on 2025-06-14 / on (web) Publishing site


Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs / 2506.13082 / ISBN:https://doi.org/10.48550/arXiv.2506.13082 / Published by ArXiv / Version released on 2025-10-06 / on (web) Publishing site


AI Through the Human Lens: Investigating Cognitive Theories in Machine Psychology / 2506.18156 / ISBN:https://doi.org/10.48550/arXiv.2506.18156 / Published by ArXiv / Version released on 2025-11-07 / on (web) Publishing site


Policy-Driven AI in Dataspaces: Taxonomy, Explainability, and Pathways for Compliant Innovation / 2507.20014 / ISBN:https://doi.org/10.48550/arXiv.2507.20014 / Published by ArXiv / Version released on 2025-07-30 / on (web) Publishing site


The AI Ethical Resonance Hypothesis: The Possibility of Discovering Moral Meta-Patterns in AI Systems / 2507.11552 / ISBN:https://doi.org/10.48550/arXiv.2507.11552 / Published by ArXiv / Version released on 2025-07-13 / on (web) Publishing site


The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and Scientist / 2507.11810 / ISBN:https://doi.org/10.48550/arXiv.2507.11810 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site


ADEPTS: A Capability Framework for Human-Centered Agent Design / 2507.15885 / ISBN:https://doi.org/10.48550/arXiv.2507.15885 / Published by ArXiv / Version released on 2025-07-18 / on (web) Publishing site


Understanding the Impact of Physicians' Legal Considerations on XAI Systems / 2507.15996 / ISBN:https://doi.org/10.48550/arXiv.2507.15996 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site


Countering Privacy Nihilism / 2507.18253 / ISBN:https://doi.org/10.48550/arXiv.2507.18253 / Published by ArXiv / Version released on 2025-07-24 / on (web) Publishing site


Rethinking Evidence Hierarchies in Medical Language Benchmarks: A Critical Evaluation of HealthBench / 2508.00081 / ISBN:https://doi.org/10.48550/arXiv.2508.00081 / Published by ArXiv / Version released on 2025-07-31 / on (web) Publishing site


Think First, Verify Always: Training Humans to Face AI Risks / 2508.03714 / ISBN:https://doi.org/10.48550/arXiv.2508.03714 / Published by ArXiv / Version released on 2025-07-23 / on (web) Publishing site


A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems / 2508.07407 / ISBN:https://doi.org/10.48550/arXiv.2508.07407 / Published by ArXiv / Version released on 2025-08-31 / on (web) Publishing site


Ethical Concerns of Generative AI and Mitigation Strategies: A Systematic Mapping Study / 2502.00015 / ISBN:https://doi.org/10.48550/arXiv.2502.00015 / Published by ArXiv / Version released on 2025-08-22 / on (web) Publishing site


A Moral Agency Framework for Legitimate Integration of AI in Bureaucracies / 2508.08231 / ISBN:https://doi.org/10.48550/arXiv.2508.08231 / Published by ArXiv / Version released on 2025-08-21 / on (web) Publishing site


Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance / 2508.08789 / ISBN:https://doi.org/10.48550/arXiv.2508.08789 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


Artificial Emotion: A Survey of Theories and Debates on Realising Emotion in Artificial Intelligence / 2508.10286 / ISBN:https://doi.org/10.48550/arXiv.2508.10286 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


The User-first Approach to AI Ethics: Preferences for Ethical Principles in AI Systems across Cultures and Contexts / 2508.11327 / ISBN:https://doi.org/10.48550/arXiv.2508.11327 / Published by ArXiv / Version released on 2025-09-17 / on (web) Publishing site


A Comprehensive Review of AI Agents: Transforming Possibilities in Technology and Beyond / 2508.11957 / ISBN:https://doi.org/10.48550/arXiv.2508.11957 / Published by ArXiv / Version released on 2025-08-16 / on (web) Publishing site


The Agent Behavior: Model, Governance and Challenges in the AI Digital Age / 2508.14415 / ISBN:https://doi.org/10.48550/arXiv.2508.14415 / Published by ArXiv / Version released on 2025-08-20 / on (web) Publishing site


The Quasi-Creature and the Uncanny Valley of Agency: A Synthesis of Theory and Evidence on User Interaction with Inconsistent Generative AI / 2508.18563 / ISBN:https://doi.org/10.48550/arXiv.2508.18563 / Published by ArXiv / Version released on 2025-08-25 / on (web) Publishing site


Towards Enhancing Data Equity in Public Health Data Science / 2508.20301 / ISBN:https://doi.org/10.48550/arXiv.2508.20301 / Published by ArXiv / Version released on 2025-08-27 / on (web) Publishing site


Bridging Minds and Machines: Toward an Integration of AI and Cognitive Science / 2508.20674 / ISBN:https://doi.org/10.48550/arXiv.2508.20674 / Published by ArXiv / Version released on 2025-08-28 / on (web) Publishing site


Developer Insights into Designing AI-Based Computer Perception Tools / 2508.21733 / ISBN:https://doi.org/10.48550/arXiv.2508.21733 / Published by ArXiv / Version released on 2025-08-29 / on (web) Publishing site


Bridging Human Cognition and AI: A Framework for Explainable Decision-Making Systems / 2509.02388 / ISBN:https://doi.org/10.48550/arXiv.2509.02388 / Published by ArXiv / Version released on 2025-09-02 / on (web) Publishing site


Between a Rock and a Hard Place: Exploiting Ethical Reasoning to Jailbreak LLMs / 2509.05367 / ISBN:https://doi.org/10.48550/arXiv.2509.05367 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site


ArGen: Auto-Regulation of Generative AI via GRPO and Policy-as-Code / 2509.07006 / ISBN:https://doi.org/10.48550/arXiv.2509.07006 / Published by ArXiv / Version released on 2025-09-06 / on (web) Publishing site


Evaluating the Clinical Safety of LLMs in Response to High-Risk Mental Health Disclosures / 2509.08839 / ISBN:https://doi.org/10.48550/arXiv.2509.08839 / Published by ArXiv / Version released on 2025-09-01 / on (web) Publishing site


Understanding the Process of Human-AI Value Alignment / 2509.13854 / ISBN:https://doi.org/10.48550/arXiv.2509.13854 / Published by ArXiv / Version released on 2025-09-17 / on (web) Publishing site


Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models / 2509.16332 / ISBN:https://doi.org/10.48550/arXiv.2509.16332 / Published by ArXiv / Version released on 2025-09-19 / on (web) Publishing site


Perceptions of AI Across Sectors: A Comparative Review of Public Attitudes / 2509.18233 / ISBN:https://doi.org/10.48550/arXiv.2509.18233 / Published by ArXiv / Version released on 2025-09-22 / on (web) Publishing site


Human-aligned AI Model Cards with Weighted Hierarchy Architecture / 2510.06989 / ISBN:https://doi.org/10.48550/arXiv.2510.06989 / Published by ArXiv / Version released on 2025-10-08 / on (web) Publishing site


The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs / 2506.11094 / ISBN:https://doi.org/10.48550/arXiv.2506.11094 / Published by ArXiv / Version released on 2025-10-30 / on (web) Publishing site


A New Digital Divide? Coder Worldviews, the Slop Economy, and Democracy in the Age of AI / 2510.04755 / ISBN:https://doi.org/10.48550/arXiv.2510.04755 / Published by ArXiv / Version released on 2025-10-23 / on (web) Publishing site


On Controlled Change: Generative AI's Impact on Professional Authority in Journalism / 2510.19792 / ISBN:https://doi.org/10.48550/arXiv.2510.19792 / Published by ArXiv / Version released on 2025-10-22 / on (web) Publishing site


Diverse Human Value Alignment for Large Language Models via Ethical Reasoning / 2511.00379 / ISBN:https://doi.org/10.48550/arXiv.2511.00379 / Published by ArXiv / Version released on 2025-11-01 / on (web) Publishing site


When Machines Join the Moral Circle: The Persona Effect of Generative AI Agents in Collaborative Reasoning / 2511.01205 / ISBN:https://doi.org/10.48550/arXiv.2511.01205 / Published by ArXiv / Version released on 2025-11-03 / on (web) Publishing site


People Perceive More Phantom Costs From Autonomous Agents When They Make Unreasonably Generous Offers / 2511.07401 / ISBN:https://doi.org/10.48550/arXiv.2511.07401 / Published by ArXiv / Version released on 2025-11-10 / on (web) Publishing site


BeautyGuard: Designing a Multi-Agent Roundtable System for Proactive Beauty Tech Compliance through Stakeholder Collaboration / 2511.12645 / ISBN:https://doi.org/10.48550/arXiv.2511.12645 / Version released on 2025-11-18 / on (web) Publishing site


Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming / 2511.15998 / ISBN:https://doi.org/10.48550/arXiv.2511.15998 / Version released on 2025-11-21 / on (web) Publishing site


Cross-cultural value alignment frameworks for responsible AI governance: Evidence from China-West comparative analysis / 2511.17256 / ISBN:https://doi.org/10.48550/arXiv.2511.17256 / Version released on 2025-11-21 / on (web) Publishing site


From Prediction to Foresight: The Role of AI in Designing Responsible Futures / 2511.21570 / ISBN:https://doi.org/10.48550/arXiv.2511.21570 / Version released on 2025-11-26 / on (web) Publishing site