if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long
if you modify the keywords, press enter within the field to confirm the new search key
Tag: reward
Bibliography items where occurs: 252
- The AI Index 2022 Annual Report / 2205.03468 / ISBN:https://doi.org/10.48550/arXiv.2205.03468 / Published by ArXiv / Version released on 2022-05-02 / on (web) Publishing site
- ESR: Ethics and Society Review of Artificial Intelligence Research / 2106.11521 / ISBN:https://doi.org/10.48550/arXiv.2106.11521 / Published by ArXiv / Version released on 2021-07-09 / on (web) Publishing site
- On the Current and Emerging Challenges of Developing Fair and Ethical AI Solutions in Financial Services / 2111.01306 / ISBN:https://doi.org/10.48550/arXiv.2111.01306 / Published by ArXiv / Version released on 2021-11-02 / on (web) Publishing site
- From Military to Healthcare: Adopting and Expanding Ethical Principles for Generative Artificial Intelligence / 2308.02448 / ISBN:https://doi.org/10.48550/arXiv.2308.02448 / Published by ArXiv / Version released on 2023-08-04 / on (web) Publishing site
- Normative Ethics Principles for Responsible AI Systems: Taxonomy and Future Directions / 2208.12616 / ISBN:https://doi.org/10.48550/arXiv.2208.12616 / Published by ArXiv / Version released on 2023-10-26 / on (web) Publishing site
- A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation / 2305.11391 / ISBN:https://doi.org/10.48550/arXiv.2305.11391 / Published by ArXiv / Version released on 2023-08-27 / on (web) Publishing site
- Exploring the Power of Creative AI Tools and Game-Based Methodologies for Interactive Web-Based Programming / 2308.11649 / ISBN:https://doi.org/10.48550/arXiv.2308.11649 / Published by ArXiv / Version released on 2023-08-18 / on (web) Publishing site
- Building Trust in Conversational AI: A Comprehensive Review and Solution Architecture for Explainable, Privacy-Aware Systems using LLMs and Knowledge Graph / 2308.13534 / ISBN:https://doi.org/10.48550/arXiv.2308.13534 / Published by ArXiv / Version released on 2023-08-13 / on (web) Publishing site
- The Promise and Peril of Artificial Intelligence -- Violet Teaming Offers a Balanced Path Forward / 2308.14253 / ISBN:https://doi.org/10.48550/arXiv.2308.14253 / Published by ArXiv / Version released on 2023-08-28 / on (web) Publishing site
- The AI Revolution: Opportunities and Challenges for the Finance Sector / 2308.16538 / ISBN:https://doi.org/10.48550/arXiv.2308.16538 / Published by ArXiv / Version released on 2023-08-31 / on (web) Publishing site
- Pathway to Future Symbiotic Creativity / 2209.02388 / ISBN:https://doi.org/10.48550/arXiv.2209.02388 / Published by ArXiv / Version released on 2023-09-13 / on (web) Publishing site
- EALM: Introducing Multidimensional Ethical Alignment in
Conversational Information Retrieval / 2310.00970 / ISBN:https://doi.org/10.48550/arXiv.2310.00970 / Published by ArXiv / Version released on 2023-10-02 / on (web) Publishing site
- Security Considerations in AI-Robotics: A Survey of Current Methods, Challenges, and Opportunities / 2310.08565 / ISBN:https://doi.org/10.48550/arXiv.2310.08565 / Published by ArXiv / Version released on 2024-01-26 / on (web) Publishing site
- If our aim is to build morality into an artificial agent, how might we begin to go about doing so? / 2310.08295 / ISBN:https://doi.org/10.48550/arXiv.2310.08295 / Published by ArXiv / Version released on 2023-10-12 / on (web) Publishing site
- A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics / 2310.05694 / ISBN:https://doi.org/10.48550/arXiv.2310.05694 / Published by ArXiv / Version released on 2025-01-27 / on (web) Publishing site
- STREAM: Social data and knowledge collective intelligence platform for TRaining Ethical AI Models / 2310.05563 / ISBN:https://doi.org/10.48550/arXiv.2310.05563 / Published by ArXiv / Version released on 2023-10-09 / on (web) Publishing site
- Towards A Unified Utilitarian Ethics Framework for Healthcare Artificial Intelligence / 2309.14617 / ISBN:https://doi.org/10.48550/arXiv.2309.14617 / Published by ArXiv / Version released on 2023-09-26 / on (web) Publishing site
- An Evaluation of GPT-4 on the ETHICS Dataset / 2309.10492 / ISBN:https://doi.org/10.48550/arXiv.2309.10492 / Published by ArXiv / Version released on 2023-09-19 / on (web) Publishing site
- The Glamorisation of Unpaid Labour: AI and its Influencers / 2308.02399 / ISBN:https://doi.org/10.48550/arXiv.2308.02399 / Published by ArXiv / Version released on 2023-09-16 / on (web) Publishing site
- AI & Blockchain as sustainable teaching and learning tools to cope with the 4IR / 2305.01088 / ISBN:https://doi.org/10.48550/arXiv.2305.01088 / Published by ArXiv / Version released on 2023-09-17 / on (web) Publishing site
- Responsible AI Pattern Catalogue: A Collection of Best Practices for AI Governance and Engineering / 2209.04963 / ISBN:https://doi.org/10.48550/arXiv.2209.04963 / Published by ArXiv / Version released on 2023-09-28 / on (web) Publishing site
- Specific versus General Principles for Constitutional AI / 2310.13798 / ISBN:https://doi.org/10.48550/arXiv.2310.13798 / Published by ArXiv / Version released on 2023-10-20 / on (web) Publishing site
- Systematic AI Approach for AGI:
Addressing Alignment, Energy, and AGI Grand Challenges / 2310.15274 / ISBN:https://doi.org/10.48550/arXiv.2310.15274 / Published by ArXiv / Version released on 2023-10-23 / on (web) Publishing site
- AI Alignment and Social Choice: Fundamental
Limitations and Policy Implications / 2310.16048 / ISBN:https://doi.org/10.48550/arXiv.2310.16048 / Published by ArXiv / Version released on 2023-10-24 / on (web) Publishing site
- Unpacking the Ethical Value Alignment in Big Models / 2310.17551 / ISBN:https://doi.org/10.48550/arXiv.2310.17551 / Published by ArXiv / Version released on 2023-10-26 / on (web) Publishing site
- LLMs grasp morality in concept / 2311.02294 / ISBN:https://doi.org/10.48550/arXiv.2311.02294 / Published by ArXiv / Version released on 2023-11-04 / on (web) Publishing site
- Educating for AI Cybersecurity Work and Research: Ethics, Systems Thinking, and
Communication Requirements / 2311.04326 / ISBN:https://doi.org/10.48550/arXiv.2311.04326 / Published by ArXiv / Version released on 2023-11-07 / on (web) Publishing site
- Unlocking the Potential of ChatGPT: A Comprehensive Exploration of its Applications, Advantages, Limitations, and Future Directions in Natural Language Processing / 2304.02017 / ISBN:https://doi.org/10.48550/arXiv.2304.02017 / Published by ArXiv / Version released on 2024-08-03 / on (web) Publishing site
- A Brief History of Prompt: Leveraging Language Models. (Through Advanced Prompting) / 2310.04438 / ISBN:https://doi.org/10.48550/arXiv.2310.04438 / Published by ArXiv / Version released on 2023-11-28 / on (web) Publishing site
- She had Cobalt Blue Eyes: Prompt Testing to Create Aligned and Sustainable Language Models / 2310.18333 / ISBN:https://doi.org/10.48550/arXiv.2310.18333 / Published by ArXiv / Version released on 2023-12-15 / on (web) Publishing site
- Safety, Trust, and Ethics Considerations for Human-AI Teaming in Aerospace Control / 2311.08943 / ISBN:https://doi.org/10.48550/arXiv.2311.08943 / Published by ArXiv / Version released on 2023-11-15 / on (web) Publishing site
- Case Repositories: Towards Case-Based Reasoning for AI Alignment / 2311.10934 / ISBN:https://doi.org/10.48550/arXiv.2311.10934 / Published by ArXiv / Version released on 2023-11-26 / on (web) Publishing site
- Large Language Models in Education: Vision and Opportunities / 2311.13160 / ISBN:https://doi.org/10.48550/arXiv.2311.13160 / Published by ArXiv / Version released on 2023-11-22 / on (web) Publishing site
- Ethical Implications of ChatGPT in Higher Education: A Scoping Review / 2311.14378 / ISBN:https://doi.org/10.48550/arXiv.2311.14378 / Published by ArXiv / Version released on 2024-06-05 / on (web) Publishing site
- Generative AI and US Intellectual Property Law / 2311.16023 / ISBN:https://doi.org/10.48550/arXiv.2311.16023 / Published by ArXiv / Version released on 2023-11-27 / on (web) Publishing site
- Intelligence Primer / 2008.07324 / ISBN:https://doi.org/10.48550/arXiv.2008.07324 / Published by ArXiv / Version released on 2025-09-03 / on (web) Publishing site
- Disentangling Perceptions of Offensiveness: Cultural and Moral Correlates / 2312.06861 / ISBN:https://doi.org/10.48550/arXiv.2312.06861 / Published by ArXiv / Version released on 2023-12-11 / on (web) Publishing site
- Learning Human-like Representations to Enable Learning Human Values / 2312.14106 / ISBN:https://doi.org/10.48550/arXiv.2312.14106 / Published by ArXiv / Version released on 2024-11-08 / on (web) Publishing site
- Improving Task Instructions for Data Annotators: How Clear Rules and Higher Pay Increase Performance in Data Annotation in the AI Economy / 2312.14565 / ISBN:https://doi.org/10.48550/arXiv.2312.14565 / Published by ArXiv / Version released on 2024-08-16 / on (web) Publishing site
- Culturally-Attuned Moral Machines: Implicit Learning of Human Value Systems by AI through Inverse Reinforcement Learning / 2312.17479 / ISBN:https://doi.org/10.48550/arXiv.2312.17479 / Published by ArXiv / Version released on 2023-12-29 / on (web) Publishing site
- Autonomous Threat Hunting: A Future Paradigm for AI-Driven Threat Intelligence / 2401.00286 / ISBN:https://doi.org/10.48550/arXiv.2401.00286 / Published by ArXiv / Version released on 2023-12-30 / on (web) Publishing site
- Towards Responsible AI in Banking: Addressing Bias for Fair Decision-Making / 2401.08691 / ISBN:https://doi.org/10.48550/arXiv.2401.08691 / Published by ArXiv / Version released on 2024-01-13 / on (web) Publishing site
- Enabling Global Image Data Sharing in the Life Sciences / 2401.13023 / ISBN:https://doi.org/10.48550/arXiv.2401.13023 / Published by ArXiv / Version released on 2024-02-02 / on (web) Publishing site
- A Scoping Study of Evaluation Practices for Responsible AI Tools: Steps Towards Effectiveness Evaluations / 2401.17486 / ISBN:https://doi.org/10.48550/arXiv.2401.17486 / Published by ArXiv / Version released on 2024-01-30 / on (web) Publishing site
- Detecting Multimedia Generated by Large AI Models: A Survey / 2402.00045 / ISBN:https://doi.org/10.48550/arXiv.2402.00045 / Published by ArXiv / Version released on 2025-07-26 / on (web) Publishing site
- (A)I Am Not a Lawyer, But...: Engaging Legal Experts towards Responsible LLM Policies for Legal Advice / 2402.01864 / ISBN:https://doi.org/10.48550/arXiv.2402.01864 / Published by ArXiv / Version released on 2024-02-02 / on (web) Publishing site
- How do machines learn? Evaluating the AIcon2abs method / 2401.07386 / ISBN:https://doi.org/10.48550/arXiv.2401.07386 / Published by ArXiv / Version released on 2026-04-23 / on (web) Publishing site
- Mapping the Ethics of Generative AI: A Comprehensive Scoping Review / 2402.08323 / ISBN:https://doi.org/10.48550/arXiv.2402.08323 / Published by ArXiv / Version released on 2024-02-13 / on (web) Publishing site
- Taking Training Seriously: Human Guidance and Management-Based Regulation of Artificial Intelligence / 2402.08466 / ISBN:https://doi.org/10.48550/arXiv.2402.08466 / Published by ArXiv / Version released on 2024-06-27 / on (web) Publishing site
- Evolving AI Collectives to Enhance Human Diversity and Enable Self-Regulation / 2402.12590 / ISBN:https://doi.org/10.48550/arXiv.2402.12590 / Published by ArXiv / Version released on 2024-06-18 / on (web) Publishing site
- What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents / 2402.13184 / ISBN:https://doi.org/10.48550/arXiv.2402.13184 / Published by ArXiv / Version released on 2025-01-01 / on (web) Publishing site
- A Survey on Human-AI Collaboration with Large Foundation Models / 2403.04931 / ISBN:https://doi.org/10.48550/arXiv.2403.04931 / Published by ArXiv / Version released on 2025-09-02 / on (web) Publishing site
- A Review of Multi-Modal Large Language and Vision Models / 2404.01322 / ISBN:https://doi.org/10.48550/arXiv.2404.01322 / Published by ArXiv / Version released on 2024-03-28 / on (web) Publishing site
- Frontier AI Ethics: Anticipating and Evaluating the Societal Impacts of Language Model Agents / 2404.06750 / ISBN:https://arxiv.org/abs/2404.06750 / Published by ArXiv / Version released on 2024-10-18 / on (web) Publishing site
- AI Alignment: A Comprehensive Survey / 2310.19852 / ISBN:https://doi.org/10.48550/arXiv.2310.19852 / Published by ArXiv / Version released on 2025-04-04 / on (web) Publishing site
- On the role of ethics and sustainability in business innovation / 2404.07678 / ISBN:https://doi.org/10.48550/arXiv.2404.07678 / Published by ArXiv / Version released on 2024-04-11 / on (web) Publishing site
- Debunking Robot Rights Metaphysically, Ethically, and Legally / 2404.10072 / ISBN:https://doi.org/10.48550/arXiv.2404.10072 / Published by ArXiv / Version released on 2024-04-15 / on (web) Publishing site
- Characterizing and modeling harms from interactions with design patterns in AI interfaces / 2404.11370 / ISBN:https://doi.org/10.48550/arXiv.2404.11370 / Published by ArXiv / Version released on 2024-05-20 / on (web) Publishing site
- From Model Performance to Claim: How a Change of Focus in Machine Learning Replicability Can Help Bridge the Responsibility Gap / 2404.13131 / ISBN:https://doi.org/10.1145/3630106.3658951 / Published by ArXiv / Version released on 2025-08-13 / on (web) Publishing site
- Fairness in AI: challenges in bridging the gap between algorithms and law / 2404.19371 / ISBN:https://doi.org/10.48550/arXiv.2404.19371 / Published by ArXiv / Version released on 2024-04-30 / on (web) Publishing site
- Towards an Ethical and Inclusive Implementation of Artificial Intelligence in Organizations: A Multidimensional Framework / 2405.01697 / ISBN:https://doi.org/10.48550/arXiv.2405.01697 / Published by ArXiv / Version released on 2024-05-02 / on (web) Publishing site
- Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback / 2404.10271 / ISBN:https://doi.org/10.48550/arXiv.2404.10271 / Published by ArXiv / Version released on 2024-06-04 / on (web) Publishing site
- Integrating Emotional and Linguistic Models for Ethical Compliance in Large Language Models / 2405.07076 / ISBN:https://doi.org/10.48550/arXiv.2405.07076 / Published by ArXiv / Version released on 2024-05-14 / on (web) Publishing site
- When AI Eats Itself: On the Caveats of Data Pollution in the Era of Generative AI
/ 2405.09597 / ISBN:https://doi.org/10.48550/arXiv.2405.09597 / Published by ArXiv / Version released on 2024-11-08 / on (web) Publishing site
- Cyber Risks of Machine Translation Critical Errors : Arabic Mental Health Tweets as a Case Study / 2405.11668 / ISBN:https://doi.org/10.48550/arXiv.2405.11668 / Published by ArXiv / Version released on 2024-05-19 / on (web) Publishing site
- The Narrow Depth and Breadth of Corporate Responsible AI Research / 2405.12193 / ISBN:https://doi.org/10.48550/arXiv.2405.12193 / Published by ArXiv / Version released on 2026-01-28 / on (web) Publishing site
- Towards Clinical AI Fairness: Filling Gaps in the Puzzle / 2405.17921 / ISBN:https://doi.org/10.48550/arXiv.2405.17921 / Published by ArXiv / Version released on 2024-05-28 / on (web) Publishing site
- The Future of Child Development in the AI Era. Cross-Disciplinary Perspectives Between AI and Child Development Experts / 2405.19275 / ISBN:https://doi.org/10.48550/arXiv.2405.19275 / Published by ArXiv / Version released on 2024-05-29 / on (web) Publishing site
- How Ethical Should AI Be? How AI Alignment Shapes the Risk Preferences of LLMs / 2406.01168 / ISBN:https://doi.org/10.48550/arXiv.2406.01168 / Published by ArXiv / Version released on 2024-08-01 / on (web) Publishing site
- Deception Analysis with Artificial Intelligence: An Interdisciplinary Perspective / 2406.05724 / ISBN:https://doi.org/10.48550/arXiv.2406.05724 / Published by ArXiv / Version released on 2024-06-11 / on (web) Publishing site
- The Impact of AI on Academic Research and Publishing / 2406.06009 / Published by ArXiv / Version released on 2024-06-10 / on (web) Publishing site
- Applications of Generative AI in Healthcare: algorithmic, ethical, legal and societal considerations / 2406.10632 / ISBN:https://doi.org/10.48550/arXiv.2406.10632 / Published by ArXiv / Version released on 2024-06-15 / on (web) Publishing site
- Leveraging Large Language Models for Patient Engagement: The Power of Conversational AI in Digital Health
/ 2406.13659 / ISBN:https://doi.org/10.48550/arXiv.2406.13659 / Published by ArXiv / Version released on 2024-06-19 / on (web) Publishing site
- AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations / 2406.18346 / ISBN:https://doi.org/10.48550/arXiv.2406.18346 / Published by ArXiv / Version released on 2024-06-26 / on (web) Publishing site
- A Survey on Privacy Attacks Against Digital Twin Systems in AI-Robotics / 2406.18812 / ISBN:https://doi.org/10.48550/arXiv.2406.18812 / Published by ArXiv / Version released on 2024-06-27 / on (web) Publishing site
- Artificial intelligence, rationalization, and the limits of control in the public sector: the case of tax policy optimization / 2407.05336 / ISBN:https://doi.org/10.48550/arXiv.2407.05336 / Published by ArXiv / Version released on 2024-07-07 / on (web) Publishing site
- A Blueprint for Auditing Generative AI / 2407.05338 / ISBN:https://doi.org/10.48550/arXiv.2407.05338 / Published by ArXiv / Version released on 2024-07-07 / on (web) Publishing site
- Thorns and Algorithms: Navigating Generative AI Challenges Inspired by Giraffes and Acacias / 2407.11360 / ISBN:https://doi.org/10.48550/arXiv.2407.11360 / Published by ArXiv / Version released on 2024-07.16 / on (web) Publishing site
- Assurance of AI Systems From a Dependability Perspective / 2407.13948 / ISBN:https://doi.org/10.48550/arXiv.2407.13948 / Published by ArXiv / Version released on 2024-08-07 / on (web) Publishing site
- The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources / 2406.16746 / ISBN:https://doi.org/10.48550/arXiv.2406.16746 / Published by ArXiv / Version released on 2024-09-03 / on (web) Publishing site
- Neuro-Symbolic AI for Military Applications / 2408.09224 / ISBN:https://doi.org/10.48550/arXiv.2408.09224 / Published by ArXiv / Version released on 2024-08-24 / on (web) Publishing site
- Conference Submission and Review Policies to Foster Responsible Computing Research / 2408.09678 / ISBN:https://doi.org/10.48550/arXiv.2408.09678 / Published by ArXiv / Version released on 2024-08-19 / on (web) Publishing site
- CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical Researcher / 2408.11650 / ISBN:https://doi.org/10.48550/arXiv.2408.11650 / Published by ArXiv / Version released on 2024-11-06 / on (web) Publishing site
- Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey / 2408.12880 / ISBN:https://doi.org/10.48550/arXiv.2408.12880 / Published by ArXiv / Version released on 2024-08-23 / on (web) Publishing site
- A Survey for Large Language Models in Biomedicine / 2409.00133 / ISBN:https://doi.org/10.48550/arXiv.2409.00133 / Published by ArXiv / Version released on 2024-08-29 / on (web) Publishing site
- On the Creativity of Large Language Models / 2304.00008 / ISBN:https://doi.org/10.48550/arXiv.2304.00008 / Published by ArXiv / Version released on 2024-09-18 / on (web) Publishing site
- LLM generated responses to mitigate the impact of hate speech / 2311.16905 / ISBN:https://doi.org/10.48550/arXiv.2311.16905 / Published by ArXiv / Version released on 2024-10-02 / on (web) Publishing site
- Why business adoption of quantum and AI technology must be ethical / 2312.10081 / ISBN:https://doi.org/10.48550/arXiv.2312.10081 / Published by ArXiv / Version released on 2024-10-08 / on (web) Publishing site
- Improving governance outcomes through AI documentation: Bridging theory and practice / 2409.08960 / ISBN:https://doi.org/10.48550/arXiv.2409.08960 / Published by ArXiv / Version released on 2024-12-09 / on (web) Publishing site
- GenAI Advertising: Risks of Personalizing Ads with LLMs / 2409.15436 / ISBN:https://doi.org/10.48550/arXiv.2409.15436 / Published by ArXiv / Version released on 2024-09-23 / on (web) Publishing site
- Artificial Human Intelligence: The role of Humans in the Development of Next Generation AI
/ 2409.16001 / ISBN:https://doi.org/10.48550/arXiv.2409.16001 / Published by ArXiv / Version released on 2025-12-05 / on (web) Publishing site
- Safety challenges of AI in medicine / 2409.18968 / ISBN:https://doi.org/10.48550/arXiv.2409.18968 / Published by ArXiv / Version released on 2024-09-11 / on (web) Publishing site
- From human-centered to social-centered artificial intelligence: Assessing ChatGPT's impact through disruptive events / 2306.00227 / ISBN:https://doi.org/10.48550/arXiv.2306.00227 / Published by ArXiv / Version released on 2024-10-25 / on (web) Publishing site
- Navigating the Cultural Kaleidoscope: A Hitchhiker's Guide to Sensitivity in Large Language Models
/ 2410.12880 / ISBN:https://doi.org/10.48550/arXiv.2410.12880 / Published by ArXiv / Version released on 2025-01-24 / on (web) Publishing site
- Redefining Finance: The Influence of Artificial Intelligence (AI) and Machine Learning (ML) / 2410.15951 / ISBN:https://doi.org/10.48550/arXiv.2410.15951 / Published by ArXiv / Version released on 2024-10-21 / on (web) Publishing site
- Democratizing Reward Design for Personal and Representative Value-Alignment / 2410.22203 / ISBN:https://doi.org/10.48550/arXiv.2410.22203 / Published by ArXiv / Version released on 2024-10-29 / on (web) Publishing site
- The Transformative Impact of AI and Deep Learning in Business: A Literature Review / 2410.23443 / ISBN:https://doi.org/10.48550/arXiv.2410.23443 / Published by ArXiv / Version released on 2024-10-30 / on (web) Publishing site
- Smoke Screens and Scapegoats: The Reality of General Data Protection Regulation Compliance -- Privacy and Ethics in the Case of Replika AI / 2411.04490 / ISBN:https://doi.org/10.48550/arXiv.2411.04490 / Published by ArXiv / Version released on 2024-11-07 / on (web) Publishing site
- A Comprehensive Review of Multimodal XR Applications, Risks, and Ethical Challenges in the Metaverse / 2411.04508 / ISBN:https://doi.org/10.48550/arXiv.2411.04508 / Published by ArXiv / Version released on 2024-11-07 / on (web) Publishing site
- A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions / 2406.03712 / ISBN:https://doi.org/10.48550/arXiv.2406.03712 / Published by ArXiv / Version released on 2024-12-09 / on (web) Publishing site
- Persuasion with Large Language Models: A Survey of Empirical Evidence, Study Methodologies, and Ethical Implications / 2411.06837 / ISBN:https://doi.org/10.48550/arXiv.2411.06837 / Published by ArXiv / Version released on 2026-04-21 / on (web) Publishing site
- Chat Bankman-Fried: an Exploration of LLM Alignment in Finance / 2411.11853 / ISBN:https://doi.org/10.48550/arXiv.2411.11853 / Published by ArXiv / Version released on 2024-11-21 / on (web) Publishing site
- Towards Foundation-model-based Multiagent System to Accelerate AI for Social Impact / 2412.07880 / ISBN:https://doi.org/10.48550/arXiv.2412.07880 / Published by ArXiv / Version released on 2024-12-12 / on (web) Publishing site
- CERN for AI: A Theoretical Framework for Autonomous Simulation-Based Artificial Intelligence Testing and Alignment / 2312.09402 / ISBN:https://doi.org/10.48550/arXiv.2312.09402 / Published by ArXiv / Version released on 2025-01-06 / on (web) Publishing site
- Reviewing Intelligent Cinematography: AI research for camera-based video production / 2405.05039 / ISBN:https://doi.org/10.48550/arXiv.2405.05039 / Published by ArXiv / Version released on 2025-01-06 / on (web) Publishing site
- Shaping AI's Impact on Billions of Lives / 2412.02730 / ISBN:https://doi.org/10.48550/arXiv.2412.02730 / Published by ArXiv / Version released on 2024-12-11 / on (web) Publishing site
- Responsible AI Governance: A Response to UN Interim Report on Governing AI for Humanity / 2412.12108 / ISBN:https://doi.org/10.48550/arXiv.2412.12108 / Published by ArXiv / Version released on 2024-12-31 / on (web) Publishing site
- User-Generated Content and Editors in Games: A Comprehensive Survey / 2412.13743 / ISBN:https://doi.org/10.48550/arXiv.2412.13743 / Published by ArXiv / Version released on 2024-12-18 / on (web) Publishing site
- Large Language Model Safety: A Holistic Survey / 2412.17686 / ISBN:https://doi.org/10.48550/arXiv.2412.17686 / Published by ArXiv / Version released on 2024-12-23 / on (web) Publishing site
- Autonomous Alignment with Human Value on Altruism through Considerate Self-imagination and Theory of Mind / 2501.00320 / ISBN:https://doi.org/10.48550/arXiv.2501.00320 / Published by ArXiv / Version released on 2025-01-07 / on (web) Publishing site
- Generative AI and LLMs in Industry: A text-mining Analysis and Critical Evaluation of Guidelines and Policy Statements Across Fourteen Industrial Sectors
/ 2501.00957 / ISBN:https://doi.org/10.48550/arXiv.2501.00957 / Published by ArXiv / Version released on 2026-03-10 / on (web) Publishing site
- Curious, Critical Thinker, Empathetic, and Ethically Responsible: Essential Soft Skills for Data Scientists in Software Engineering / 2501.02088 / ISBN:https://doi.org/10.48550/arXiv.2501.02088 / Published by ArXiv / Version released on 2025-01-29 / on (web) Publishing site
- Hybrid Approaches for Moral Value Alignment in AI Agents: a Manifesto / 2312.01818 / ISBN:https://doi.org/10.48550/arXiv.2312.01818 / Published by ArXiv / Version released on 2025-01-16 / on (web) Publishing site
- A Blockchain-Enabled Approach to Cross-Border Compliance and Trust / 2501.09182 / ISBN:https://doi.org/10.48550/arXiv.2501.09182 / Published by ArXiv / Version released on 2025-01-15 / on (web) Publishing site
- Responsible Generative AI Use by Product Managers: Recoupling Ethical Principles and Practices / 2501.16531 / ISBN:https://doi.org/10.48550/arXiv.2501.16531 / Published by ArXiv / Version released on 2025-01-27 / on (web) Publishing site
- Governing the Agent-to-Agent Economy of Trust via Progressive Decentralization / 2501.16606 / ISBN:https://doi.org/10.48550/arXiv.2501.16606 / Published by ArXiv / Version released on 2025-01-28 / on (web) Publishing site
- A Case Study in Acceleration AI Ethics: The TELUS GenAI Conversational Agent
/ 2501.18038 / ISBN:https://doi.org/10.48550/arXiv.2501.18038 / Published by ArXiv / Version released on 2025-03-26 / on (web) Publishing site
- Agentic AI: Expanding the Algorithmic Frontier of Creative Problem Solving / 2502.00289 / ISBN:https://doi.org/10.48550/arXiv.2502.00289 / Published by ArXiv / Version released on 2025-02-01 / on (web) Publishing site
- Cognitive AI framework 2.0: advances in the simulation of human thought / 2502.04259 / ISBN:https://doi.org/10.48550/arXiv.2502.04259 / Published by ArXiv / Version released on 2026-01-21 / on (web) Publishing site
- Open Foundation Models in Healthcare: Challenges, Paradoxes, and Opportunities with GenAI Driven Personalized Prescription / 2502.04356 / ISBN:https://doi.org/10.48550/arXiv.2502.04356 / Published by ArXiv / Version released on 2025-02-04 / on (web) Publishing site
- Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety / 2502.05206 / ISBN:https://doi.org/10.48550/arXiv.2502.05206 / Published by ArXiv / Version released on 2026-04-14 / on (web) Publishing site
- The Odyssey of the Fittest: Can Agents Survive and Still Be Good? / 2502.05442 / ISBN:https://doi.org/10.48550/arXiv.2502.05442 / Published by ArXiv / Version released on 2025-07-15 / on (web) Publishing site
- Prioritization First, Principles Second: An Adaptive Interpretation of Helpful, Honest, and Harmless Principles / 2502.06059 / ISBN:https://doi.org/10.48550/arXiv.2502.06059 / Published by ArXiv / Version released on 2025-12-27 / on (web) Publishing site
- Fairness in Multi-Agent AI: A Unified Framework for Ethical and Equitable Autonomous Systems / 2502.07254 / ISBN:https://doi.org/10.48550/arXiv.2502.07254 / Published by ArXiv / Version released on 2025-02-11 / on (web) Publishing site
- From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine
/ 2502.09242 / ISBN:https://doi.org/10.48550/arXiv.2502.09242 / Published by ArXiv / Version released on 2025-02-13 / on (web) Publishing site
- AI and the Transformation of Accountability and Discretion in Urban Governance / 2502.13101 / ISBN:https://doi.org/10.48550/arXiv.2502.13101 / Published by ArXiv / Version released on 2025-04-16 / on (web) Publishing site
- Multi-Agent Risks from Advanced AI / 2502.14143 / ISBN:https://doi.org/10.48550/arXiv.2502.14143 / Published by ArXiv / Version released on 2025-02-19 / on (web) Publishing site
- On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective / 2502.14296 / ISBN:https://doi.org/10.48550/arXiv.2502.14296 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site
- Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models / 2502.18505 / ISBN:https://doi.org/10.48550/arXiv.2502.18505 / Published by ArXiv / Version released on 2025-02-21 / on (web) Publishing site
- Developmental Support Approach to AI's Autonomous Growth: Toward the Realization of a Mutually Beneficial Stage Through Experiential Learning / 2502.19798 / ISBN:https://doi.org/10.48550/arXiv.2502.19798 / Published by ArXiv / Version released on 2025-02-27 / on (web) Publishing site
- Mapping out AI Functions in Intelligent Disaster (Mis)Management and AI-Caused Disasters / 2502.16644 / ISBN:https://doi.org/10.48550/arXiv.2502.16644 / Published by ArXiv / Version released on 2025-02-26 / on (web) Publishing site
- DarkBench: Benchmarking Dark Patterns in Large Language Models / 2503.10728 / ISBN:https://doi.org/10.48550/arXiv.2503.10728 / Published by ArXiv / Version released on 2025-03-13 / on (web) Publishing site
- Advancing Human-Machine Teaming: Concepts, Challenges, and Applications
/ 2503.16518 / ISBN:https://doi.org/10.48550/arXiv.2503.16518 / Published by ArXiv / Version released on 2025-05-06 / on (web) Publishing site
- AI Identity, Empowerment, and Mindfulness in Mitigating Unethical AI Use / 2503.20099 / ISBN:https://doi.org/10.48550/arXiv.2503.20099 / Published by ArXiv / Version released on 2025-03-25 / on (web) Publishing site
- Generative AI and News Consumption: Design Fictions and Critical Analysis / 2503.20391 / ISBN:https://doi.org/10.48550/arXiv.2503.20391 / Published by ArXiv / Version released on 2025-03-26 / on (web) Publishing site
- AI Family Integration Index (AFII): Benchmarking a New Global Readiness for AI as Family / 2503.22772 / ISBN:https://doi.org/10.48550/arXiv.2503.22772 / Published by ArXiv / Version released on 2025-03-28 / on (web) Publishing site
- Who is Responsible When AI Fails? Mapping Causes, Entities, and Consequences of AI Privacy and Ethical Incidents / 2504.01029 / ISBN:https://doi.org/10.48550/arXiv.2504.01029 / Published by ArXiv / Version released on 2025-09-18 / on (web) Publishing site
- AI Regulation and Capitalist Growth: Balancing Innovation, Ethics, and Global Governance / 2504.02000 / ISBN:https://doi.org/10.48550/arXiv.2504.02000 / Published by ArXiv / Version released on 2025-04-01 / on (web) Publishing site
- Ethical AI on the Waitlist: Group Fairness Evaluation of LLM-Aided Organ Allocation / 2504.03716 / ISBN:https://doi.org/10.48550/arXiv.2504.03716 / Published by ArXiv / Version released on 2025-03-29 / on (web) Publishing site
- Language-Dependent Political Bias in AI: A Study of ChatGPT and Gemini / 2504.06436 / ISBN:https://doi.org/10.48550/arXiv.2504.06436 / Published by ArXiv / Version released on 2025-04-08 / on (web) Publishing site
- We Are All Creators: Generative AI, Collective Knowledge, and the Path Towards Human-AI Synergy / 2504.07936 / ISBN:https://doi.org/10.48550/arXiv.2504.07936 / Published by ArXiv / Version released on 2025-04-10 / on (web) Publishing site
- A Comprehensive Survey on Integrating Large Language Models with Knowledge-Based Methods / 2501.13947 / ISBN:https://doi.org/10.48550/arXiv.2501.13947 / Published by ArXiv / Version released on 2025-05-01 / on (web) Publishing site
- Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation / 2502.05151 / ISBN:https://doi.org/10.48550/arXiv.2502.05151 / Published by ArXiv / Version released on 2026-03-05 / on (web) Publishing site
- Designing AI-Enabled Countermeasures to Cognitive Warfare / 2504.11486 / ISBN:https://doi.org/10.48550/arXiv.2504.11486 / Published by ArXiv / Version released on 2025-04-14 / on (web) Publishing site
- Framework, Standards, Applications and Best practices of Responsible AI : A Comprehensive Survey / 2504.13979 / ISBN:https://doi.org/10.48550/arXiv.2504.13979 / Published by ArXiv / Version released on 2025-04-18 / on (web) Publishing site
- Auditing the Ethical Logic of Generative AI Models / 2504.17544 / ISBN:https://doi.org/10.48550/arXiv.2504.17544 / Published by ArXiv / Version released on 2025-04-24 / on (web) Publishing site
- Federated learning, ethics, and the double black box problem in medical AI
/ 2504.20656 / ISBN:https://doi.org/10.48550/arXiv.2504.20656 / Published by ArXiv / Version released on 2025-04-29 / on (web) Publishing site
- From Texts to Shields: Convergence of Large Language Models and Cybersecurity / 2505.00841 / ISBN:https://doi.org/10.48550/arXiv.2505.00841 / Published by ArXiv / Version released on 2025-05-01 / on (web) Publishing site
- Ethics and Persuasion in Reinforcement Learning from Human Feedback: A Procedural Rhetorical Approach / 2505.09576 / ISBN:https://doi.org/10.48550/arXiv.2505.09576 / Published by ArXiv / Version released on 2025-05-14 / on (web) Publishing site
- Sentience Quest: Towards Embodied, Emotionally Adaptive, Self-Evolving, Ethically Aligned Artificial General Intelligence / 2505.12229 / ISBN:https://doi.org/10.48550/arXiv.2505.12229 / Published by ArXiv / Version released on 2025-05-18 / on (web) Publishing site
- AI vs. Human Judgment of Content Moderation: LLM-as-a-Judge and Ethics-Based Response Refusals / 2505.15365 / ISBN:https://doi.org/10.48550/arXiv.2505.15365 / Published by ArXiv / Version released on 2025-05-21 / on (web) Publishing site
- Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods / 2505.17870 / ISBN:https://doi.org/10.48550/arXiv.2505.17870 / Published by ArXiv / Version released on 2025-05-23 / on (web) Publishing site
- Making Sense of the Unsensible: Reflection, Survey, and Challenges for XAI in Large Language Models Toward Human-Centered AI / 2505.20305 / ISBN:https://doi.org/10.48550/arXiv.2505.20305 / Published by ArXiv / Version released on 2025-05-18 / on (web) Publishing site
- Exploring Societal Concerns and Perceptions of AI: A Thematic Analysis through the Lens of Problem-Seeking / 2505.23930 / ISBN:https://doi.org/10.48550/arXiv.2505.23930 / Published by ArXiv / Version released on 2025-05-29 / on (web) Publishing site
- Locating Risk: Task Designers and the Challenge of Risk Disclosure in RAI Content Work / 2505.24246 / ISBN:https://doi.org/10.48550/arXiv.2505.24246 / Published by ArXiv / Version released on 2026-03-31 / on (web) Publishing site
- Wide Reflective Equilibrium in LLM Alignment: Bridging Moral Epistemology and AI Safety
/ 2506.00415 / ISBN:https://doi.org/10.48550/arXiv.2506.00415 / Published by ArXiv / Version released on 2025-05-31 / on (web) Publishing site
- DeepSeek in Healthcare: A Survey of Capabilities, Risks, and Clinical Applications of Open-Source Large Language Models / 2506.01257 / ISBN:https://doi.org/10.48550/arXiv.2506.01257 / Published by ArXiv / Version released on 2025-06-02 / on (web) Publishing site
- HADA: Human-AI Agent Decision Alignment Architecture / 2506.04253 / ISBN:https://doi.org/10.48550/arXiv.2506.04253 / Published by ArXiv / Version released on 2025-06-01 / on (web) Publishing site
- Subjective Experience in AI Systems: What Do AI Researchers and the Public Believe? / 2506.11945 / ISBN:https://doi.org/10.48550/arXiv.2506.11945 / Published by ArXiv / Version released on 2025-06-13 / on (web) Publishing site
- I Hadn't Thought About That: Creators of Human-like AI Weigh in on Ethics And Neurodivergence / 2506.12098 / ISBN:https://doi.org/10.48550/arXiv.2506.12098 / Published by ArXiv / Version released on 2025-06-12 / on (web) Publishing site
- A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications / 2506.12594 / ISBN:https://doi.org/10.48550/arXiv.2506.12594 / Published by ArXiv / Version released on 2025-06-14 / on (web) Publishing site
- Constitutive Components for Human-Like Autonomous Artificial Intelligence / 2506.12952 / ISBN:https://doi.org/10.48550/arXiv.2506.12952 / Published by ArXiv / Version released on 2025-06-15 / on (web) Publishing site
- Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs / 2506.13082 / ISBN:https://doi.org/10.48550/arXiv.2506.13082 / Published by ArXiv / Version released on 2026-03-06 / on (web) Publishing site
- Foundation of Affective Computing and Interaction
/ 2506.15497 / ISBN:https://doi.org/10.48550/arXiv.2506.15497 / Published by ArXiv / Version released on 2025-06-18 / on (web) Publishing site
- Making the Right Thing: Bridging HCI and Responsible AI in Early-Stage AI Concept Selection / 2506.17494 / ISBN:https://doi.org/10.48550/arXiv.2506.17494 / Published by ArXiv / Version released on 2025-06-20 / on (web) Publishing site
- Adapting University Policies for Generative AI: Opportunities, Challenges, and Policy Solutions in Higher Education / 2506.22231 / ISBN:https://doi.org/10.48550/arXiv.2506.22231 / Published by ArXiv / Version released on 2025-06-27 / on (web) Publishing site
- Moral Responsibility or Obedience: What Do We Want from AI? / 2507.02788 / ISBN:https://doi.org/10.48550/arXiv.2507.02788 / Published by ArXiv / Version released on 2025-07-03 / on (web) Publishing site
- Strategic Alignment Patterns in National AI Policies / 2507.05400 / ISBN:https://doi.org/10.48550/arXiv.2507.05400 / Published by ArXiv / Version released on 2025-07-07 / on (web) Publishing site
- AI Human Impact: Toward a Model for Ethical Investing in AI-Intensive Companies / 2507.07703 / ISBN:https://doi.org/10.48550/arXiv.2507.07703 / Published by ArXiv / Version released on 2025-07-10 / on (web) Publishing site
- When Large Language Models Meet Law: Dual-Lens Taxonomy, Technical Advances, and Ethical Governance / 2507.07748 / ISBN:https://doi.org/10.48550/arXiv.2507.07748 / Published by ArXiv / Version released on 2025-07-10 / on (web) Publishing site
- Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics / 2506.12365 / ISBN:https://doi.org/10.48550/arXiv.2506.12365 / Published by ArXiv / Version released on 2025-07-31 / on (web) Publishing site
- The AI Ethical Resonance Hypothesis: The Possibility of Discovering Moral Meta-Patterns in AI Systems / 2507.11552 / ISBN:https://doi.org/10.48550/arXiv.2507.11552 / Published by ArXiv / Version released on 2025-07-13 / on (web) Publishing site
- The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and Scientist / 2507.11810 / ISBN:https://doi.org/10.48550/arXiv.2507.11810 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site
- Challenges of Trustworthy Federated Learning: What's Done, Current Trends and Remaining Work / 2507.15796 / ISBN:https://doi.org/10.48550/arXiv.2507.15796 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site
- ADEPTS: A Capability Framework for Human-Centered Agent Design / 2507.15885 / ISBN:https://doi.org/10.48550/arXiv.2507.15885 / Published by ArXiv / Version released on 2025-07-18 / on (web) Publishing site
- Advancing Responsible Innovation in Agentic AI: A study of Ethical Frameworks for Household Automation / 2507.15901 / ISBN:https://doi.org/10.48550/arXiv.2507.15901 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site
- PRAC3 (Privacy, Reputation, Accountability, Consent, Credit, Compensation): Long Tailed Risks of Voice Actors in AI Data-Economy / 2507.16247 / ISBN:https://doi.org/10.48550/arXiv.2507.16247 / Published by ArXiv / Version released on 2025-07-22 / on (web) Publishing site
- Strategic Motivators for Ethical AI System Development: An Empirical and Holistic Model / 2507.20218 / ISBN:https://doi.org/10.48550/arXiv.2507.20218 / Published by ArXiv / Version released on 2025-07-25 / on (web) Publishing site
- Rethinking Evidence Hierarchies in Medical Language Benchmarks: A Critical Evaluation of HealthBench / 2508.00081 / ISBN:https://doi.org/10.48550/arXiv.2508.00081 / Published by ArXiv / Version released on 2025-07-31 / on (web) Publishing site
- Generative AI as a Geopolitical Factor in Industry 5.0: Sovereignty, Access, and Control / 2508.00973 / ISBN:https://doi.org/10.48550/arXiv.2508.00973 / Published by ArXiv / Version released on 2025-08-01 / on (web) Publishing site
- The Silicon Reasonable Person: Can AI Predict How Ordinary People Judge Reasonableness? / 2508.02766 / ISBN:https://doi.org/10.48550/arXiv.2508.02766 / Published by ArXiv / Version released on 2025-08-04 / on (web) Publishing site
- Development of management systems using artificial intelligence systems and machine learning methods for boards of directors (preprint, unofficial translation) / 2508.03769 / ISBN:https://doi.org/10.48550/arXiv.2508.03769 / Published by ArXiv / Version released on 2025-08-05 / on (web) Publishing site
- The Fair Game: Auditing & Debiasing AI Algorithms Over Time / 2508.06443 / ISBN:https://doi.org/10.48550/arXiv.2508.06443 / Published by ArXiv / Version released on 2025-08-08 / on (web) Publishing site
- A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems / 2508.07407 / ISBN:https://doi.org/10.48550/arXiv.2508.07407 / Published by ArXiv / Version released on 2025-08-31 / on (web) Publishing site
- A Moral Agency Framework for Legitimate Integration of AI in Bureaucracies / 2508.08231 / ISBN:https://doi.org/10.48550/arXiv.2508.08231 / Published by ArXiv / Version released on 2025-08-21 / on (web) Publishing site
- Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance / 2508.08789 / ISBN:https://doi.org/10.48550/arXiv.2508.08789 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site
- A Comprehensive Review of Datasets for Clinical Mental Health AI Systems / 2508.09809 / ISBN:https://doi.org/10.48550/arXiv.2508.09809 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site
- Artificial Emotion: A Survey of Theories and Debates on Realising Emotion in Artificial Intelligence / 2508.10286 / ISBN:https://doi.org/10.48550/arXiv.2508.10286 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site
- An Intelligent Infrastructure as a Foundation for Modern Science / 2508.10051 / ISBN:https://doi.org/10.48550/arXiv.2508.10051 / Published by ArXiv / Version released on 2025-08-12 / on (web) Publishing site
- A Comprehensive Review of AI Agents: Transforming Possibilities in Technology and Beyond / 2508.11957 / ISBN:https://doi.org/10.48550/arXiv.2508.11957 / Published by ArXiv / Version released on 2025-08-16 / on (web) Publishing site
- The Agent Behavior: Model, Governance and Challenges in the AI Digital Age / 2508.14415 / ISBN:https://doi.org/10.48550/arXiv.2508.14415 / Published by ArXiv / Version released on 2025-08-20 / on (web) Publishing site
- Augmentation Technologies and AI - An Ethical Design Futures Framework / 2508.16615 / ISBN:https://doi.org/10.48550/arXiv.2508.16615 / Published by ArXiv / Version released on 2025-08-13 / on (web) Publishing site
- AI as IA: The use and abuse of artificial intelligence (AI) for human enhancement through intellectual augmentation (IA) / 2508.16642 / ISBN:https://doi.org/10.48550/arXiv.2508.16642 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site
- Socially Interactive Agents for Preserving and Transferring Tacit Knowledge in Organizations / 2508.19942 / ISBN:https://doi.org/10.48550/arXiv.2508.19942 / Published by ArXiv / Version released on 2025-08-27 / on (web) Publishing site
- Bridging Minds and Machines: Toward an Integration of AI and Cognitive Science / 2508.20674 / ISBN:https://doi.org/10.48550/arXiv.2508.20674 / Published by ArXiv / Version released on 2025-08-28 / on (web) Publishing site
- Beyond Prediction: Reinforcement Learning as the Defining Leap in Healthcare AI / 2508.21101 / ISBN:https://doi.org/10.48550/arXiv.2508.21101 / Published by ArXiv / Version released on 2025-08-28 / on (web) Publishing site
- Designing LMS and Instructional Strategies for Integrating Generative-Conversational AI / 2509.00709 / ISBN:https://doi.org/10.48550/arXiv.2509.00709 / Published by ArXiv / Version released on 2025-08-31 / on (web) Publishing site
- Structured AI Decision-Making in Disaster Management / 2509.01576 / ISBN:https://doi.org/10.48550/arXiv.2509.01576 / Published by ArXiv / Version released on 2025-09-01 / on (web) Publishing site
- Between a Rock and a Hard Place: Exploiting Ethical Reasoning to Jailbreak LLMs / 2509.05367 / ISBN:https://doi.org/10.48550/arXiv.2509.05367 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site
- AI Governance in Higher Education: A course design exploring regulatory, ethical and practical considerationsAI Governance in Higher Education: A course design exploring regulatory, ethical and practical considerations / 2509.06176 / ISBN:https://doi.org/10.48550/arXiv.2509.06176 / Published by ArXiv / Version released on 2025-09-16 / on (web) Publishing site
- ArGen: Auto-Regulation of Generative AI via GRPO and Policy-as-Code / 2509.07006 / ISBN:https://doi.org/10.48550/arXiv.2509.07006 / Published by ArXiv / Version released on 2025-09-06 / on (web) Publishing site
- Safe and Certifiable AI Systems: Concepts, Challenges, and Lessons Learned / 2509.08852 / ISBN:https://doi.org/10.48550/arXiv.2509.08852 / Published by ArXiv / Version released on 2025-09-08 / on (web) Publishing site
- The Ultimate Test of Superintelligent AI Agents: Can an AI Balance Care and Control in Asymmetric Relationships? / 2506.01813 / ISBN:https://doi.org/10.48550/arXiv.2506.01813 / Published by ArXiv / Version released on 2025-09-29 / on (web) Publishing site
- Web3 x AI Agents: Landscape, Integrations, and Foundational Challenges / 2508.02773 / ISBN:https://doi.org/10.48550/arXiv.2508.02773 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site
- Understanding the Process of Human-AI Value Alignment / 2509.13854 / ISBN:https://doi.org/10.48550/arXiv.2509.13854 / Published by ArXiv / Version released on 2025-09-17 / on (web) Publishing site
- Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models / 2509.16332 / ISBN:https://doi.org/10.48550/arXiv.2509.16332 / Published by ArXiv / Version released on 2025-09-19 / on (web) Publishing site
- Trust and Transparency in AI: Industry Voices on Data, Ethics, and Compliance / 2509.22709 / ISBN:https://doi.org/10.48550/arXiv.2509.22709 / Published by ArXiv / Version released on 2025-09-23 / on (web) Publishing site
- Reconsidering Requirements Engineering: Human-AI Collaboration in AI-Native Software Development / 2510.04380 / ISBN:https://doi.org/10.1007/978-3-032-04190-6_11 / Published by ArXiv / Version released on 2025-10-05 / on (web) Publishing site
- Fully Autonomous AI Agents Should Not be Developed / 2502.02649 / ISBN:https://doi.org/10.48550/arXiv.2502.02649 / Published by ArXiv / Version released on 2025-10-20 / on (web) Publishing site
- The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs
/ 2506.11094 / ISBN:https://doi.org/10.48550/arXiv.2506.11094 / Published by ArXiv / Version released on 2025-10-30 / on (web) Publishing site
- A New Digital Divide? Coder Worldviews, the Slop Economy, and Democracy in the Age of AI / 2510.04755 / ISBN:https://doi.org/10.48550/arXiv.2510.04755 / Published by ArXiv / Version released on 2025-12-17 / on (web) Publishing site
- Understanding AI Trustworthiness: A Scoping Review of AIES & FAccT Articles / 2510.21293 / ISBN:https://doi.org/10.48550/arXiv.2510.21293 / Published by ArXiv / Version released on 2025-10-28 / on (web) Publishing site
- Making Power Explicable in AI: Analyzing, Understanding, and Redirecting Power to Operationalize Ethics in AI Technical Practice / 2510.10588 / ISBN:https://doi.org/10.48550/arXiv.2510.10588 / Published by ArXiv / Version released on 2025-10-12 / on (web) Publishing site
- AI Alignment vs. AI Ethical Treatment: 10 Challenges / 2510.12844 / ISBN:https://doi.org/10.48550/arXiv.2510.12844 / Published by ArXiv / Version released on 2025-10-14 / on (web) Publishing site
- How Can AI Augment Access to Justice? Public Defenders' Perspectives on AI Adoption / 2510.22933 / ISBN:https://doi.org/10.48550/arXiv.2510.22933 / Published by ArXiv / Version released on 2026-04-25 / on (web) Publishing site
- Diverse Human Value Alignment for Large Language Models via Ethical Reasoning / 2511.00379 / ISBN:https://doi.org/10.48550/arXiv.2511.00379 / Published by ArXiv / Version released on 2025-11-01 / on (web) Publishing site
- Systematizing LLM Persona Design: A Four-Quadrant Technical Taxonomy for AI Companion Applications / 2511.02979 / ISBN:https://doi.org/10.48550/arXiv.2511.02979 / Published by ArXiv / Version released on 2026-01-23 / on (web) Publishing site
- People Perceive More Phantom Costs From Autonomous Agents When They Make Unreasonably Generous Offers / 2511.07401 / ISBN:https://doi.org/10.48550/arXiv.2511.07401 / Published by ArXiv / Version released on 2025-11-10 / on (web) Publishing site
- SciSciGPT: Advancing Human-AI Collaboration in the Science of Science / 2504.05559 / ISBN:https://doi.org/10.48550/arXiv.2504.05559 / Published by ArXiv / Version released on 2025-11-27 / on (web) Publishing site
- Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming / 2511.15998 / ISBN:https://doi.org/10.48550/arXiv.2511.15998 / Published by ArXiv / Version released on 2025-11-21 / on (web) Publishing site
- Cross-cultural value alignment frameworks for responsible AI governance: Evidence from China-West comparative analysis / 2511.17256 / ISBN:https://doi.org/10.48550/arXiv.2511.17256 / Published by ArXiv / Version released on 2025-11-21 / on (web) Publishing site
- Towards Synergistic Teacher-AI Interactions with Generative Artificial Intelligence / 2511.19580 / ISBN:https://doi.org/10.48550/arXiv.2511.19580 / Published by ArXiv / Version released on 2025-11-24 / on (web) Publishing site
- Morality in AI. A plea to embed morality in LLM architectures and frameworks / 2511.20689 / ISBN:https://doi.org/10.48550/arXiv.2511.20689 / Published by ArXiv / Version released on 2025-11-21 / on (web) Publishing site
- A Human-centric Framework for Debating the Ethics of AI Consciousness Under Uncertainty
/ 2512.02544 / ISBN:https://doi.org/10.48550/arXiv.2512.02544 / Published by ArXiv / Version released on 2025-12-02 / on (web) Publishing site
- From Challenge to Change: Design Principles for AI Transformations
/ 2512.05533 / ISBN:https://doi.org/10.48550/arXiv.2512.05533 / Published by ArXiv / Version released on 2025-12-05 / on (web) Publishing site
- Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research / 2512.10058 / ISBN:https://doi.org/10.48550/arXiv.2512.10058 / Published by ArXiv / Version released on 2025-12-10 / on (web) Publishing site
- The Subject of Emergent Misalignment in Superintelligence: An Anthropological, Cognitive Neuropsychological, Machine-Learning, and Ontological Perspective
/ 2512.17989 / ISBN:https://arxiv.org/abs/2512.17989 / Published by ArXiv / Version released on 2026-02-25 / on (web) Publishing site
- Legal Alignment for Safe and Ethical AI / 2601.04175 / ISBN:https://doi.org/10.48550/arXiv.2601.04175 / Published by ArXiv / Version released on 2026-01-07 / on (web) Publishing site
- Research Integrity and Academic Authority in the Age of Artificial Intelligence: From Discovery to Curation? / 2601.05574 / ISBN:https://doi.org/10.48550/arXiv.2601.05574 / Published by ArXiv / Version released on 2026-01-09 / on (web) Publishing site
- Epistemic Constitutionalism Or: how to avoid coherence bias / 2601.14295 / ISBN:https://doi.org/10.48550/arXiv.2601.14295 / Published by ArXiv / Version released on 2026-04-22 / on (web) Publishing site
- Reimagining Legal Fact Verification with GenAI: Toward Effective Human-AI Collaboration / 2602.06305 / ISBN:https://doi.org/10.48550/arXiv.2602.06305 / Published by ArXiv / Version released on 2026-02-09 / on (web) Publishing site
- Conversational AI for Social Good (CAI4SG): An Overview of Emerging Trends, Applications, and Challenges
/ 2601.15136 / ISBN:https://doi.org/10.48550/arXiv.2601.15136 / Published by ArXiv / Version released on 2026-01-21 / on (web) Publishing site
- AI-RP: The AI Relationship Process Framework / 2601.17351 / ISBN:https://doi.org/10.48550/arXiv.2601.17351 / Published by ArXiv / Version released on 2026-01-24 / on (web) Publishing site
- Unsupervised Elicitation of Moral Values from Language Models / 2601.17728 / ISBN:https://doi.org/10.48550/arXiv.2601.17728 / Published by ArXiv / Version released on 2026-01-25 / on (web) Publishing site
- Beyond Abstract Compliance: Operationalising trust in AI as a moral relationship / 2601.22769 / ISBN:https://doi.org/10.48550/arXiv.2601.22769 / Published by ArXiv / Version released on 2026-01-30 / on (web) Publishing site
- Human Society-Inspired Approaches to Agentic AI Security: The 4C Framework / 2602.01942 / ISBN:https://doi.org/10.48550/arXiv.2602.01942 / Published by ArXiv / Version released on 2026-02-02 / on (web) Publishing site
- Futuring Social Assemblages: How Enmeshing AIs into Social Life Challenges the Individual and the Interpersonal / 2602.03958 / ISBN:https://doi.org/10.48550/arXiv.2602.03958 / Published by ArXiv / Version released on 2026-02-03 / on (web) Publishing site
- Reliable and Responsible Foundation Models: A Comprehensive Survey / 2602.08145 / ISBN:https://doi.org/10.48550/arXiv.2602.08145 / Published by ArXiv / Version released on 2026-02-04 / on (web) Publishing site
- CogniAlign: Survivability-Grounded Multi-Agent Moral Reasoning for Safe and Transparent AI / 2509.13356 / ISBN:https://doi.org/10.48550/arXiv.2509.13356 / Version released on 2026-02-21 / on (web) Publishing site
- Dark and Bright Side of Participatory Red-Teaming with Targets of Stereotyping for Eliciting Harmful Behaviors from Large Language Models / 2602.19124 / ISBN:https://doi.org/10.48550/arXiv.2602.19124 / Version released on 2026-02-22 / on (web) Publishing site
- Personal Data as a Human Right: A New Social Contract Based on Data Sovereignty, Human Dignity and Data Personalism / 2602.23918 / ISBN:https://doi.org/10.48550/arXiv.2602.23918 / Version released on 2026-02-27 / on (web) Publishing site
- Building the ethical AI framework of the future: from philosophy to practice
/ 2603.06599 / ISBN:https://doi.org/10.48550/arXiv.2603.06599 / Version released on 2026-02-16 / on (web) Publishing site
- Must Read: A Comprehensive Survey of Computational Persuasion / 2505.07775 / ISBN:https://doi.org/10.48550/arXiv.2505.07775 / Version released on 2026-03-23 / on (web) Publishing site
- Bridging the Gap in the Responsible AI Divides
/ 2603.14495 / ISBN:https://doi.org/10.48550/arXiv.2603.14495 / Version released on 2026-03-15 / on (web) Publishing site
- Narrative Frames: A New Approach to Analysing Metaphors in AI Ethics and Policy Discourse
/ 2603.17192 / ISBN:https://doi.org/10.48550/arXiv.2603.17192 / Version released on 2026-03-17 / on (web) Publishing site
- Ghosting the Machine: Stop Calling Human-Agent Relations Parasocial / 2604.05197 / ISBN:https://doi.org/10.48550/arXiv.2604.05197 / Published by ArXiv / Version released on 2026-04-14 / on (web) Publishing site
- AI Integrity: A New Paradigm for Verifiable AI Governance / 2604.11065 / ISBN:https://doi.org/10.48550/arXiv.2604.11065 / Version released on 2026-04-13 / on (web) Publishing site
- Strategic Polysemy in AI Discourse: A Philosophical Analysis of Language, Hype, and Power / 2604.21043 / ISBN:https://doi.org/10.48550/arXiv.2604.21043 / Version released on 2026-04-22 / on (web) Publishing site
- Ambient Persuasion in a Deployed AI Agent: Unauthorized Escalation Following Routine Non-Adversarial Content Exposure
/ 2605.00055 / ISBN:https://doi.org/10.48550/arXiv.2605.00055 / Version released on 2026-04-29 / on (web) Publishing site
- Reflections and New Directions for Human-Centered Large Language Models / 2605.06901 / ISBN:https://doi.org/10.48550/arXiv.2605.06901 / Version released on 2026-05-07 / on (web) Publishing site
- LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey / 2505.00753 / ISBN:https://doi.org/10.48550/arXiv.2505.00753 / Version released on 2026-05-06 / on (web) Publishing site
- Co-Constructing Alignment: A Participatory Approach to Situate AI Values / 2601.15895 / ISBN:https://doi.org/10.48550/arXiv.2601.15895 / Version released on 2026-04-21 / on (web) Publishing site
_