_
RobertoLofaro.com - Knowledge Portal - human-generated content
Change, with and without technology - human, AI, scraping readers welcome
for updates on publications, follow: on Instagram, Twitter, Patreon, YouTube


_

You are now here: AI Ethics Primer - search within the bibliography - version 0.4 of 2023-12-13 > (tag cloud) >tag_selected: defense


Currently searching for:

if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long


if you modify the keywords, press enter within the field to confirm the new search key

Tag: defense

Bibliography items where occurs: 202
The AI Index 2022 Annual Report / 2205.03468 / ISBN:https://doi.org/10.48550/arXiv.2205.03468 / Published by ArXiv / Version released on 2022-05-02 / on (web) Publishing site


AI Ethics Issues in Real World: Evidence from AI Incident Database / 2206.07635 / ISBN:https://doi.org/10.48550/arXiv.2206.07635 / Published by ArXiv / Version released on 2022-08-18 / on (web) Publishing site


The Different Faces of AI Ethics Across the World: A Principle-Implementation Gap Analysis / 2206.03225 / ISBN:https://doi.org/10.48550/arXiv.2206.03225 / Published by ArXiv / Version released on 2022-05-12 / on (web) Publishing site


Worldwide AI Ethics: a review of 200 guidelines and recommendations for AI governance / 2206.11922 / ISBN:https://doi.org/10.48550/arXiv.2206.11922 / Published by ArXiv / Version released on 2024-02-19 / on (web) Publishing site


From OECD to India: Exploring cross-cultural differences in perceived trust, responsibility and reliance of AI and human experts / 2307.15452 / ISBN:https://doi.org/10.48550/arXiv.2307.15452 / Published by ArXiv / Version released on 2023-07-28 / on (web) Publishing site


The Ethics of AI Value Chains / 2307.16787 / ISBN:https://doi.org/10.48550/arXiv.2307.16787 / Published by ArXiv / Version released on 2024-09-18 / on (web) Publishing site


From Military to Healthcare: Adopting and Expanding Ethical Principles for Generative Artificial Intelligence / 2308.02448 / ISBN:https://doi.org/10.48550/arXiv.2308.02448 / Published by ArXiv / Version released on 2023-08-04 / on (web) Publishing site


Ethical Considerations and Policy Implications for Large Language Models: Guiding Responsible Development and Deployment / 2308.02678 / ISBN:https://doi.org/10.48550/arXiv.2308.02678 / Published by ArXiv / Version released on 2023-08-01 / on (web) Publishing site


Dual Governance: The intersection of centralized regulation and crowdsourced safety mechanisms for Generative AI / 2308.04448 / ISBN:https://doi.org/10.48550/arXiv.2308.04448 / Published by ArXiv / Version released on 2023-08-02 / on (web) Publishing site


Bad, mad, and cooked: Moral responsibility for civilian harms in human-AI military teams / 2211.06326 / ISBN:https://doi.org/10.48550/arXiv.2211.06326 / Published by ArXiv / Version released on 2023-09-06 / on (web) Publishing site


The Promise and Peril of Artificial Intelligence -- Violet Teaming Offers a Balanced Path Forward / 2308.14253 / ISBN:https://doi.org/10.48550/arXiv.2308.14253 / Published by ArXiv / Version released on 2023-08-28 / on (web) Publishing site


The AI Revolution: Opportunities and Challenges for the Finance Sector / 2308.16538 / ISBN:https://doi.org/10.48550/arXiv.2308.16538 / Published by ArXiv / Version released on 2023-08-31 / on (web) Publishing site


Security Considerations in AI-Robotics: A Survey of Current Methods, Challenges, and Opportunities / 2310.08565 / ISBN:https://doi.org/10.48550/arXiv.2310.08565 / Published by ArXiv / Version released on 2024-01-26 / on (web) Publishing site


A Review of the Ethics of Artificial Intelligence and its Applications in the United States / 2310.05751 / ISBN:https://doi.org/10.48550/arXiv.2310.05751 / Published by ArXiv / Version released on 2023-10-09 / on (web) Publishing site


Autonomous Vehicles an overview on system, cyber security, risks, issues, and a way forward / 2309.14213 / ISBN:https://doi.org/10.48550/arXiv.2309.14213 / Published by ArXiv / Version released on 2023-09-25 / on (web) Publishing site


An Evaluation of GPT-4 on the ETHICS Dataset / 2309.10492 / ISBN:https://doi.org/10.48550/arXiv.2309.10492 / Published by ArXiv / Version released on 2023-09-19 / on (web) Publishing site


FUTURE-AI: International consensus guideline for trustworthy and deployable artificial intelligence in healthcare / 2309.12325 / ISBN:https://doi.org/10.48550/arXiv.2309.12325 / Published by ArXiv / Version released on 2024-07-08 / on (web) Publishing site


Specific versus General Principles for Constitutional AI / 2310.13798 / ISBN:https://doi.org/10.48550/arXiv.2310.13798 / Published by ArXiv / Version released on 2023-10-20 / on (web) Publishing site


Artificial Intelligence Ethics Education in Cybersecurity: Challenges and Opportunities: a focus group report / 2311.00903 / ISBN:https://doi.org/10.48550/arXiv.2311.00903 / Published by ArXiv / Version released on 2023-11-02 / on (web) Publishing site


Human participants in AI research: Ethics and transparency in practice / 2311.01254 / ISBN:https://doi.org/10.48550/arXiv.2311.01254 / Published by ArXiv / Version released on 2024-09-26 / on (web) Publishing site


Educating for AI Cybersecurity Work and Research: Ethics, Systems Thinking, and Communication Requirements / 2311.04326 / ISBN:https://doi.org/10.48550/arXiv.2311.04326 / Published by ArXiv / Version released on 2023-11-07 / on (web) Publishing site


Safety, Trust, and Ethics Considerations for Human-AI Teaming in Aerospace Control / 2311.08943 / ISBN:https://doi.org/10.48550/arXiv.2311.08943 / Published by ArXiv / Version released on 2023-11-15 / on (web) Publishing site


Generative AI and US Intellectual Property Law / 2311.16023 / ISBN:https://doi.org/10.48550/arXiv.2311.16023 / Published by ArXiv / Version released on 2023-11-27 / on (web) Publishing site


Survey on AI Ethics: A Socio-technical Perspective / 2311.17228 / ISBN:https://doi.org/10.48550/arXiv.2311.17228 / Published by ArXiv / Version released on 2025-11-04 / on (web) Publishing site


Deepfakes, Misinformation, and Disinformation in the Era of Frontier AI, Generative AI, and Large AI Models / 2311.17394 / ISBN:https://doi.org/10.48550/arXiv.2311.17394 / Published by ArXiv / Version released on 2023-11-29 / on (web) Publishing site


Intelligence Primer / 2008.07324 / ISBN:https://doi.org/10.48550/arXiv.2008.07324 / Published by ArXiv / Version released on 2025-09-03 / on (web) Publishing site


Disentangling Perceptions of Offensiveness: Cultural and Moral Correlates / 2312.06861 / ISBN:https://doi.org/10.48550/arXiv.2312.06861 / Published by ArXiv / Version released on 2023-12-11 / on (web) Publishing site


Autonomous Threat Hunting: A Future Paradigm for AI-Driven Threat Intelligence / 2401.00286 / ISBN:https://doi.org/10.48550/arXiv.2401.00286 / Published by ArXiv / Version released on 2023-12-30 / on (web) Publishing site


MULTI-CASE: A Transformer-based Ethics-aware Multimodal Investigative Intelligence Framework / 2401.01955 / ISBN:https://doi.org/10.48550/arXiv.2401.01955 / Published by ArXiv / Version released on 2024-01-03 / on (web) Publishing site


Detecting Multimedia Generated by Large AI Models: A Survey / 2402.00045 / ISBN:https://doi.org/10.48550/arXiv.2402.00045 / Published by ArXiv / Version released on 2025-07-26 / on (web) Publishing site


Commercial AI, Conflict, and Moral Responsibility: A theoretical analysis and practical approach to the moral responsibilities associated with dual-use AI technology / 2402.01762 / ISBN:https://doi.org/10.48550/arXiv.2402.01762 / Published by ArXiv / Version released on 2024-01-30 / on (web) Publishing site


Ethics in AI through the Practitioner's View: A Grounded Theory Literature Review / 2206.09514 / ISBN:https://doi.org/10.48550/arXiv.2206.09514 / Published by ArXiv / Version released on 2024-02-20 / on (web) Publishing site


What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents / 2402.13184 / ISBN:https://doi.org/10.48550/arXiv.2402.13184 / Published by ArXiv / Version released on 2025-01-01 / on (web) Publishing site


Towards an AI-Enhanced Cyber Threat Intelligence Processing Pipeline / 2403.03265 / ISBN:https://doi.org/10.48550/arXiv.2403.03265 / Published by ArXiv / Version released on 2024-03-05 / on (web) Publishing site


A Survey on Human-AI Collaboration with Large Foundation Models / 2403.04931 / ISBN:https://doi.org/10.48550/arXiv.2403.04931 / Published by ArXiv / Version released on 2025-09-02 / on (web) Publishing site


Responsible Artificial Intelligence: A Structured Literature Review / 2403.06910 / ISBN:https://doi.org/10.48550/arXiv.2403.06910 / Published by ArXiv / Version released on 2024-03-11 / on (web) Publishing site


Towards a Privacy and Security-Aware Framework for Ethical AI: Guiding the Development and Assessment of AI Systems / 2403.08624 / ISBN:https://doi.org/10.48550/arXiv.2403.08624 / Published by ArXiv / Version released on 2024-03-13 / on (web) Publishing site


Review of Generative AI Methods in Cybersecurity / 2403.08701 / ISBN:https://doi.org/10.48550/arXiv.2403.08701 / Published by ArXiv / Version released on 2024-03-19 / on (web) Publishing site


Trust in AI: Progress, Challenges, and Future Directions / 2403.14680 / ISBN:https://doi.org/10.48550/arXiv.2403.14680 / Published by ArXiv / Version released on 2024-04-04 / on (web) Publishing site


AI Ethics: A Bibliometric Analysis, Critical Issues, and Key Gaps / 2403.14681 / ISBN:https://doi.org/10.48550/arXiv.2403.14681 / Published by ArXiv / Version released on 2024-03-12 / on (web) Publishing site


The Journey to Trustworthy AI- Part 1 Pursuit of Pragmatic Frameworks / 2403.15457 / ISBN:https://doi.org/10.48550/arXiv.2403.15457 / Published by ArXiv / Version released on 2024-04-06 / on (web) Publishing site


AI Alignment: A Comprehensive Survey / 2310.19852 / ISBN:https://doi.org/10.48550/arXiv.2310.19852 / Published by ArXiv / Version released on 2025-04-04 / on (web) Publishing site


Epistemic Power in AI Ethics Labor: Legitimizing Located Complaints / 2402.08171 / ISBN:https://doi.org/10.1145/3630106.3658973 / Published by ArXiv / Version released on 2024-04-17 / on (web) Publishing site


Debunking Robot Rights Metaphysically, Ethically, and Legally / 2404.10072 / ISBN:https://doi.org/10.48550/arXiv.2404.10072 / Published by ArXiv / Version released on 2024-04-15 / on (web) Publishing site


Taxonomy to Regulation: A (Geo)Political Taxonomy for AI Risks and Regulatory Measures in the EU AI Act / 2404.11476 / ISBN:https://doi.org/10.48550/arXiv.2404.11476 / Published by ArXiv / Version released on 2024-04-17 / on (web) Publishing site


Large Language Model Supply Chain: A Research Agenda / 2404.12736 / ISBN:https://doi.org/10.48550/arXiv.2404.12736 / Published by ArXiv / Version released on 2024-04-19 / on (web) Publishing site


The Necessity of AI Audit Standards Boards / 2404.13060 / ISBN:https://doi.org/10.48550/arXiv.2404.13060 / Published by ArXiv / Version released on 2024-04-11 / on (web) Publishing site


Who Followed the Blueprint? Analyzing the Responses of U.S. Federal Agencies to the Blueprint for an AI Bill of Rights / 2404.19076 / ISBN:https://doi.org/10.48550/arXiv.2404.19076 / Published by ArXiv / Version released on 2024-04-29 / on (web) Publishing site


War Elephants: Rethinking Combat AI and Human Oversight / 2404.19573 / ISBN:https://doi.org/10.48550/arXiv.2404.19573 / Published by ArXiv / Version released on 2024-04-30 / on (web) Publishing site


A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law / 2405.01769 / ISBN:https://doi.org/10.48550/arXiv.2405.01769 / Published by ArXiv / Version released on 2024-11-21 / on (web) Publishing site


AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research / 2405.01859 / ISBN:https://doi.org/10.48550/arXiv.2405.01859 / Published by ArXiv / Version released on 2024-05-31 / on (web) Publishing site


Trustworthy AI-Generative Content in Intelligent 6G Network: Adversarial, Privacy, and Fairness / 2405.05930 / ISBN:https://doi.org/10.48550/arXiv.2405.05930 / Published by ArXiv / Version released on 2024-05-09 / on (web) Publishing site


XXAI: Towards eXplicitly eXplainable Artificial Intelligence / 2401.03093 / ISBN:https://doi.org/10.48550/arXiv.2401.03093 / Published by ArXiv / Version released on 2024-05-19 / on (web) Publishing site


A Comprehensive Overview of Large Language Models (LLMs) for Cyber Defences: Opportunities and Directions / 2405.14487 / ISBN:https://doi.org/10.48550/arXiv.2405.14487 / Published by ArXiv / Version released on 2024-05-23 / on (web) Publishing site


Responsible AI for Earth Observation / 2405.20868 / ISBN:https://doi.org/10.48550/arXiv.2405.20868 / Published by ArXiv / Version released on 2024-05-31 / on (web) Publishing site


Gender Bias Detection in Court Decisions: A Brazilian Case Study / 2406.00393 / ISBN:https://doi.org/10.48550/arXiv.2406.00393 / Published by ArXiv / Version released on 2024-06-01 / on (web) Publishing site


MoralBench: Moral Evaluation of LLMs / 2406.04428 / Published by ArXiv / Version released on 2025-07-04 / on (web) Publishing site


Deception Analysis with Artificial Intelligence: An Interdisciplinary Perspective / 2406.05724 / ISBN:https://doi.org/10.48550/arXiv.2406.05724 / Published by ArXiv / Version released on 2024-06-11 / on (web) Publishing site


Federated Learning driven Large Language Models for Swarm Intelligence: A Survey / 2406.09831 / ISBN:https://doi.org/10.48550/arXiv.2406.09831 / Published by ArXiv / Version released on 2024-06-14 / on (web) Publishing site


Current state of LLM Risks and AI Guardrails / 2406.12934 / ISBN:https://doi.org/10.48550/arXiv.2406.12934 / Published by ArXiv / Version released on 2024-06-16 / on (web) Publishing site


A Survey on Privacy Attacks Against Digital Twin Systems in AI-Robotics / 2406.18812 / ISBN:https://doi.org/10.48550/arXiv.2406.18812 / Published by ArXiv / Version released on 2024-06-27 / on (web) Publishing site


SecGenAI: Enhancing Security of Cloud-based Generative AI Applications within Australian Critical Technologies of National Interest / 2407.01110 / ISBN:https://doi.org/10.48550/arXiv.2407.01110 / Published by ArXiv / Version released on 2024-07-01 / on (web) Publishing site


A Blueprint for Auditing Generative AI / 2407.05338 / ISBN:https://doi.org/10.48550/arXiv.2407.05338 / Published by ArXiv / Version released on 2024-07-07 / on (web) Publishing site


Thorns and Algorithms: Navigating Generative AI Challenges Inspired by Giraffes and Acacias / 2407.11360 / ISBN:https://doi.org/10.48550/arXiv.2407.11360 / Published by ArXiv / Version released on 2024-07.16 / on (web) Publishing site


Assurance of AI Systems From a Dependability Perspective / 2407.13948 / ISBN:https://doi.org/10.48550/arXiv.2407.13948 / Published by ArXiv / Version released on 2024-08-07 / on (web) Publishing site


AI-Driven Chatbot for Intrusion Detection in Edge Networks: Enhancing Cybersecurity with Ethical User Consent / 2408.04281 / ISBN:https://doi.org/10.48550/arXiv.2408.04281 / Published by ArXiv / Version released on 2024-08-08 / on (web) Publishing site


Between Copyright and Computer Science: The Law and Ethics of Generative AI / 2403.14653 / ISBN:https://doi.org/10.48550/arXiv.2403.14653 / Published by ArXiv / Version released on 2024-09-05 / on (web) Publishing site


The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources / 2406.16746 / ISBN:https://doi.org/10.48550/arXiv.2406.16746 / Published by ArXiv / Version released on 2024-09-03 / on (web) Publishing site


Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives / 2407.14962 / ISBN:https://doi.org/10.48550/arXiv.2407.14962 / Published by ArXiv / Version released on 2024-08-23 / on (web) Publishing site


Neuro-Symbolic AI for Military Applications / 2408.09224 / ISBN:https://doi.org/10.48550/arXiv.2408.09224 / Published by ArXiv / Version released on 2024-08-24 / on (web) Publishing site


Conference Submission and Review Policies to Foster Responsible Computing Research / 2408.09678 / ISBN:https://doi.org/10.48550/arXiv.2408.09678 / Published by ArXiv / Version released on 2024-08-19 / on (web) Publishing site


Don't Kill the Baby: The Case for AI in Arbitration / 2408.11608 / ISBN:https://doi.org/10.48550/arXiv.2408.11608 / Published by ArXiv / Version released on 2025-03-22 / on (web) Publishing site


Promises and challenges of generative artificial intelligence for human learning / 2408.12143 / ISBN:https://doi.org/10.48550/arXiv.2408.12143 / Published by ArXiv / Version released on 2024-09-05 / on (web) Publishing site


Catalog of General Ethical Requirements for AI Certification / 2408.12289 / ISBN:https://doi.org/10.48550/arXiv.2408.12289 / Published by ArXiv / Version released on 2024-11-15 / on (web) Publishing site


Is Generative AI the Next Tactical Cyber Weapon For Threat Actors? Unforeseen Implications of AI Generated Cyber Attacks / 2408.12806 / ISBN:https://doi.org/10.48550/arXiv.2408.12806 / Published by ArXiv / Version released on 2024-08-23 / on (web) Publishing site


On the Creativity of Large Language Models / 2304.00008 / ISBN:https://doi.org/10.48550/arXiv.2304.00008 / Published by ArXiv / Version released on 2024-09-18 / on (web) Publishing site


Social Media Bot Policies: Evaluating Passive and Active Enforcement / 2409.18931 / ISBN:https://doi.org/10.48550/arXiv.2409.18931 / Published by ArXiv / Version released on 2024-09-27 / on (web) Publishing site


Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure / 2409.19104 / ISBN:https://doi.org/10.48550/arXiv.2409.19104 / Published by ArXiv / Version released on 2024-09-27 / on (web) Publishing site


Trust or Bust: Ensuring Trustworthiness in Autonomous Weapon Systems / 2410.10284 / ISBN:https://doi.org/10.48550/arXiv.2410.10284 / Published by ArXiv / Version released on 2024-10-21 / on (web) Publishing site


How Do AI Companies Fine-Tune Policy? Examining Regulatory Capture in AI Governance / 2410.13042 / ISBN:https://doi.org/10.48550/arXiv.2410.13042 / Published by ArXiv / Version released on 2024-10-16 / on (web) Publishing site


Data Defenses Against Large Language Models / 2410.13138 / ISBN:https://doi.org/10.48550/arXiv.2410.13138 / Published by ArXiv / Version released on 2024-10-17 / on (web) Publishing site


Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems / 2410.13334 / ISBN:https://doi.org/10.48550/arXiv.2410.13334 / Published by ArXiv / Version released on 2024-10-23 / on (web) Publishing site


A Simulation System Towards Solving Societal-Scale Manipulation / 2410.13915 / ISBN:https://doi.org/10.48550/arXiv.2410.13915 / Published by ArXiv / Version released on 2024-10-17 / on (web) Publishing site


Jailbreaking and Mitigation of Vulnerabilities in Large Language Models / 2410.15236 / ISBN:https://doi.org/10.48550/arXiv.2410.15236 / Published by ArXiv / Version released on 2025-11-25 / on (web) Publishing site


Trustworthy XAI and Application / 2410.17139 / ISBN:https://doi.org/10.48550/arXiv.2410.17139 / Published by ArXiv / Version released on 2025-04-16 / on (web) Publishing site


Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements / 2410.17141 / ISBN:https://doi.org/10.48550/arXiv.2410.17141 / Published by ArXiv / Version released on 2025-01-30 / on (web) Publishing site


The Cat and Mouse Game: The Ongoing Arms Race Between Diffusion Models and Detection Methods / 2410.18866 / ISBN:https://doi.org/10.48550/arXiv.2410.18866 / Published by ArXiv / Version released on 2024-10-24 / on (web) Publishing site


A Comprehensive Review of Multimodal XR Applications, Risks, and Ethical Challenges in the Metaverse / 2411.04508 / ISBN:https://doi.org/10.48550/arXiv.2411.04508 / Published by ArXiv / Version released on 2024-11-07 / on (web) Publishing site


How should AI decisions be explained? Requirements for Explanations from the Perspective of European Law / 2404.12762 / ISBN:https://doi.org/10.48550/arXiv.2404.12762 / Published by ArXiv / Version released on 2024-11-26 / on (web) Publishing site


Artificial Intelligence in Cybersecurity: Building Resilient Cyber Diplomacy Frameworks / 2411.13585 / ISBN:https://doi.org/10.48550/arXiv.2411.13585 / Published by ArXiv / Version released on 2024-11-17 / on (web) Publishing site


AI-Augmented Ethical Hacking: A Practical Examination of Manual Exploitation and Privilege Escalation in Linux Environments / 2411.17539 / ISBN:https://doi.org/10.48550/arXiv.2411.17539 / Published by ArXiv / Version released on 2024-11-26 / on (web) Publishing site


Human-centred test and evaluation of military AI / 2412.01978 / ISBN:https://doi.org/10.48550/arXiv.2412.01978 / Published by ArXiv / Version released on 2024-12-02 / on (web) Publishing site


Large Language Models in Politics and Democracy: A Comprehensive Survey / 2412.04498 / ISBN:https://doi.org/10.48550/arXiv.2412.04498 / Published by ArXiv / Version released on 2024-12-16 / on (web) Publishing site


Shaping AI's Impact on Billions of Lives / 2412.02730 / ISBN:https://doi.org/10.48550/arXiv.2412.02730 / Published by ArXiv / Version released on 2024-12-11 / on (web) Publishing site


AI Ethics in Smart Homes: Progress, User Requirements and Challenges / 2412.09813 / ISBN:https://doi.org/10.48550/arXiv.2412.09813 / Published by ArXiv / Version released on 2024-12-13 / on (web) Publishing site


On Large Language Models in Mission-Critical IT Governance: Are We Ready Yet? / 2412.11698 / ISBN:https://doi.org/10.48550/arXiv.2412.11698 / Published by ArXiv / Version released on 2025-01-10 / on (web) Publishing site


Clio: Privacy-Preserving Insights into Real-World AI Use / 2412.13678 / ISBN:https://doi.org/10.48550/arXiv.2412.13678 / Published by ArXiv / Version released on 2024-12-18 / on (web) Publishing site


User-Generated Content and Editors in Games: A Comprehensive Survey / 2412.13743 / ISBN:https://doi.org/10.48550/arXiv.2412.13743 / Published by ArXiv / Version released on 2024-12-18 / on (web) Publishing site


Autonomous Vehicle Security: A Deep Dive into Threat Modeling / 2412.15348 / ISBN:https://doi.org/10.48550/arXiv.2412.15348 / Published by ArXiv / Version released on 2024-12-19 / on (web) Publishing site


Large Language Model Safety: A Holistic Survey / 2412.17686 / ISBN:https://doi.org/10.48550/arXiv.2412.17686 / Published by ArXiv / Version released on 2024-12-23 / on (web) Publishing site


Self-Disclosure to AI: The Paradox of Trust and Vulnerability in Human-Machine Interactions / 2412.20564 / ISBN:https://doi.org/10.48550/arXiv.2412.20564 / Published by ArXiv / Version released on 2024-12-29 / on (web) Publishing site


A Blockchain-Enabled Approach to Cross-Border Compliance and Trust / 2501.09182 / ISBN:https://doi.org/10.48550/arXiv.2501.09182 / Published by ArXiv / Version released on 2025-01-15 / on (web) Publishing site


Uncovering Bias in Foundation Models: Impact, Testing, Harm, and Mitigation / 2501.10453 / ISBN:https://doi.org/10.48550/arXiv.2501.10453 / Published by ArXiv / Version released on 2025-01-14 / on (web) Publishing site


The Third Moment of AI Ethics: Developing Relatable and Contextualized Tools / 2501.16954 / ISBN:https://doi.org/10.48550/arXiv.2501.16954 / Published by ArXiv / Version released on 2025-01-28 / on (web) Publishing site


Towards Safe AI Clinicians: A Comprehensive Study on Large Language Model Jailbreaking in Healthcare / 2501.18632 / ISBN:https://doi.org/10.48550/arXiv.2501.18632 / Published by ArXiv / Version released on 2025-01-27 / on (web) Publishing site


Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety / 2502.05206 / ISBN:https://doi.org/10.48550/arXiv.2502.05206 / Published by ArXiv / Version released on 2026-04-14 / on (web) Publishing site


Prioritization First, Principles Second: An Adaptive Interpretation of Helpful, Honest, and Harmless Principles / 2502.06059 / ISBN:https://doi.org/10.48550/arXiv.2502.06059 / Published by ArXiv / Version released on 2025-12-27 / on (web) Publishing site


On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective / 2502.14296 / ISBN:https://doi.org/10.48550/arXiv.2502.14296 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site


Transforming Cyber Defense: Harnessing Agentic and Frontier AI for Proactive, Ethical Threat Intelligence / 2503.00164 / ISBN:https://doi.org/10.48550/arXiv.2503.00164 / Published by ArXiv / Version released on 2025-02-28 / on (web) Publishing site


Decoding the Black Box: Integrating Moral Imagination with Technical AI Governance / 2503.06411 / ISBN:https://doi.org/10.48550/arXiv.2503.06411 / Published by ArXiv / Version released on 2025-03-09 / on (web) Publishing site


AI Governance InternationaL Evaluation Index (AGILE Index) 2024 / 2502.15859 / ISBN:https://doi.org/10.48550/arXiv.2502.15859 / Published by ArXiv / Version released on 2025-07-17 / on (web) Publishing site


Mapping out AI Functions in Intelligent Disaster (Mis)Management and AI-Caused Disasters / 2502.16644 / ISBN:https://doi.org/10.48550/arXiv.2502.16644 / Published by ArXiv / Version released on 2025-02-26 / on (web) Publishing site


A Peek Behind the Curtain: Using Step-Around Prompt Engineering to Identify Bias and Misinformation in GenAI Models / 2503.15205 / ISBN:https://doi.org/10.48550/arXiv.2503.15205 / Published by ArXiv / Version released on 2026-01-22 / on (web) Publishing site


Advancing Human-Machine Teaming: Concepts, Challenges, and Applications / 2503.16518 / ISBN:https://doi.org/10.48550/arXiv.2503.16518 / Published by ArXiv / Version released on 2025-05-06 / on (web) Publishing site


HH4AI: A methodological Framework for AI Human Rights impact assessment under the EUAI ACT / 2503.18994 / ISBN:https://doi.org/10.48550/arXiv.2503.18994 / Published by ArXiv / Version released on 2025-03-23 / on (web) Publishing site


Who is Responsible When AI Fails? Mapping Causes, Entities, and Consequences of AI Privacy and Ethical Incidents / 2504.01029 / ISBN:https://doi.org/10.48550/arXiv.2504.01029 / Published by ArXiv / Version released on 2025-09-18 / on (web) Publishing site


A Framework for Developing University Policies on Generative AI Governance: A Cross-national Comparative Study / 2504.02636 / ISBN:https://doi.org/10.48550/arXiv.2504.02636 / Published by ArXiv / Version released on 2025-11-18 / on (web) Publishing site


We Are All Creators: Generative AI, Collective Knowledge, and the Path Towards Human-AI Synergy / 2504.07936 / ISBN:https://doi.org/10.48550/arXiv.2504.07936 / Published by ArXiv / Version released on 2025-04-10 / on (web) Publishing site


Who is Responsible? The Data, Models, Users or Regulations? A Comprehensive Survey on Responsible Generative AI for a Sustainable Future / 2502.08650 / ISBN:https://doi.org/10.48550/arXiv.2502.08650 / Published by ArXiv / Version released on 2025-04-28 / on (web) Publishing site


Designing AI-Enabled Countermeasures to Cognitive Warfare / 2504.11486 / ISBN:https://doi.org/10.48550/arXiv.2504.11486 / Published by ArXiv / Version released on 2025-04-14 / on (web) Publishing site


Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions / 2504.15236 / ISBN:https://doi.org/10.48550/arXiv.2504.15236 / Published by ArXiv / Version released on 2025-04-21 / on (web) Publishing site


Towards responsible AI for education: Hybrid human-AI to confront the Elephant in the room / 2504.16148 / ISBN:https://doi.org/10.48550/arXiv.2504.16148 / Published by ArXiv / Version released on 2025-04-22 / on (web) Publishing site


Approaches to Responsible Governance of GenAI in Organizations / 2504.17044 / ISBN:https://doi.org/10.48550/arXiv.2504.17044 / Published by ArXiv / Version released on 2025-09-14 / on (web) Publishing site


AI Ethics and Social Norms: Exploring ChatGPT's Capabilities From What to How / 2504.18044 / ISBN:https://doi.org/10.48550/arXiv.2504.18044 / Published by ArXiv / Version released on 2025-04-25 / on (web) Publishing site


Generative AI in Financial Institution: A Global Survey of Opportunities, Threats, and Regulation / 2504.21574 / ISBN:https://doi.org/10.48550/arXiv.2504.21574 / Published by ArXiv / Version released on 2025-04-30 / on (web) Publishing site


From Texts to Shields: Convergence of Large Language Models and Cybersecurity / 2505.00841 / ISBN:https://doi.org/10.48550/arXiv.2505.00841 / Published by ArXiv / Version released on 2025-05-01 / on (web) Publishing site


Securing the Future of IVR: AI-Driven Innovation with Agile Security, Data Regulation, and Ethical AI Integration / 2505.01514 / ISBN:https://doi.org/10.48550/arXiv.2505.01514 / Published by ArXiv / Version released on 2025-05-02 / on (web) Publishing site


SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use / 2505.17332 / ISBN:https://doi.org/10.48550/arXiv.2505.17332 / Published by ArXiv / Version released on 2025-05-22 / on (web) Publishing site


Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods / 2505.17870 / ISBN:https://doi.org/10.48550/arXiv.2505.17870 / Published by ArXiv / Version released on 2025-05-23 / on (web) Publishing site


Human-Centered Human-AI Collaboration (HCHAC) / 2505.22477 / ISBN:https://doi.org/10.48550/arXiv.2505.22477 / Published by ArXiv / Version released on 2025-05-28 / on (web) Publishing site


SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents / 2505.23559 / ISBN:https://doi.org/10.48550/arXiv.2505.23559 / Published by ArXiv / Version released on 2025-05-29 / on (web) Publishing site


Unintentional Consequences: Generative AI Use for Cybercrime / 2505.23733 / ISBN:https://doi.org/10.48550/arXiv.2505.23733 / Published by ArXiv / Version released on 2025-12-03 / on (web) Publishing site


DeepSeek in Healthcare: A Survey of Capabilities, Risks, and Clinical Applications of Open-Source Large Language Models / 2506.01257 / ISBN:https://doi.org/10.48550/arXiv.2506.01257 / Published by ArXiv / Version released on 2025-06-02 / on (web) Publishing site


Machine vs Machine: Using AI to Tackle Generative AI Threats in Assessment / 2506.02046 / ISBN:https://doi.org/10.48550/arXiv.2506.02046 / Published by ArXiv / Version released on 2025-05-31 / on (web) Publishing site


Mitigating Manipulation and Enhancing Persuasion: A Reflective Multi-Agent Approach for Legal Argument Generation / 2506.02992 / ISBN:https://doi.org/10.48550/arXiv.2506.02992 / Published by ArXiv / Version released on 2025-06-03 / on (web) Publishing site


On the Ethics of Using LLMs for Offensive Security / 2506.08693 / ISBN:https://doi.org/10.48550/arXiv.2506.08693 / Published by ArXiv / Version released on 2025-06-10 / on (web) Publishing site


Subjective Experience in AI Systems: What Do AI Researchers and the Public Believe? / 2506.11945 / ISBN:https://doi.org/10.48550/arXiv.2506.11945 / Published by ArXiv / Version released on 2025-06-13 / on (web) Publishing site


Reversing the Paradigm: Building AI-First Systems with Human Guidance / 2506.12245 / ISBN:https://doi.org/10.48550/arXiv.2506.12245 / Published by ArXiv / Version released on 2025-06-13 / on (web) Publishing site


Feeling Machines: Ethics, Culture, and the Rise of Emotional AI / 2506.12437 / ISBN:https://doi.org/10.48550/arXiv.2506.12437 / Published by ArXiv / Version released on 2025-06-14 / on (web) Publishing site


Regulating Next-Generation Implantable Brain-Computer Interfaces: Recommendations for Ethical Development and Implementation / 2506.12540 / ISBN:https://doi.org/10.48550/arXiv.2506.12540 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site


Foundation of Affective Computing and Interaction / 2506.15497 / ISBN:https://doi.org/10.48550/arXiv.2506.15497 / Published by ArXiv / Version released on 2025-06-18 / on (web) Publishing site


The AI Policy Module: Developing Computer Science Student Competency in AI Ethics and Policy / 2506.15639 / ISBN:https://doi.org/10.48550/arXiv.2506.15639 / Published by ArXiv / Version released on 2025-06-18 / on (web) Publishing site


Adapting University Policies for Generative AI: Opportunities, Challenges, and Policy Solutions in Higher Education / 2506.22231 / ISBN:https://doi.org/10.48550/arXiv.2506.22231 / Published by ArXiv / Version released on 2025-06-27 / on (web) Publishing site


On the Surprising Efficacy of LLMs for Penetration-Testing / 2507.00829 / ISBN:https://doi.org/10.48550/arXiv.2507.00829 / Published by ArXiv / Version released on 2025-07-01 / on (web) Publishing site


AI Human Impact: Toward a Model for Ethical Investing in AI-Intensive Companies / 2507.07703 / ISBN:https://doi.org/10.48550/arXiv.2507.07703 / Published by ArXiv / Version released on 2025-07-10 / on (web) Publishing site


When Large Language Models Meet Law: Dual-Lens Taxonomy, Technical Advances, and Ethical Governance / 2507.07748 / ISBN:https://doi.org/10.48550/arXiv.2507.07748 / Published by ArXiv / Version released on 2025-07-10 / on (web) Publishing site


Artificial Intelligence Governance for Businesses / 2011.10672 / ISBN:https://doi.org/10.48550/arXiv.2011.10672 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site


Deepfake Technology Unveiled: The Commoditization of AI and Its Impact on Digital Trust / 2506.07363 / ISBN:https://doi.org/10.48550/arXiv.2506.07363 / Published by ArXiv / Version released on 2025-07-15 / on (web) Publishing site


Policy-Driven AI in Dataspaces: Taxonomy, Explainability, and Pathways for Compliant Innovation / 2507.20014 / ISBN:https://doi.org/10.48550/arXiv.2507.20014 / Published by ArXiv / Version released on 2025-07-30 / on (web) Publishing site


Exploiting Jailbreaking Vulnerabilities in Generative AI to Bypass Ethical Safeguards for Facilitating Phishing Attacks / 2507.12185 / ISBN:https://doi.org/10.48550/arXiv.2507.12185 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site


Redefining Elderly Care with Agentic AI: Challenges and Opportunities / 2507.14912 / ISBN:https://doi.org/10.48550/arXiv.2507.14912 / Published by ArXiv / Version released on 2025-07-20 / on (web) Publishing site


Challenges of Trustworthy Federated Learning: What's Done, Current Trends and Remaining Work / 2507.15796 / ISBN:https://doi.org/10.48550/arXiv.2507.15796 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site


Generative AI as a Geopolitical Factor in Industry 5.0: Sovereignty, Access, and Control / 2508.00973 / ISBN:https://doi.org/10.48550/arXiv.2508.00973 / Published by ArXiv / Version released on 2025-08-01 / on (web) Publishing site


DIRF: A Framework for Digital Identity Protection and Clone Governance in Agentic AI Systems / 2508.01997 / ISBN:https://doi.org/10.48550/arXiv.2508.01997 / Published by ArXiv / Version released on 2025-09-08 / on (web) Publishing site


Think First, Verify Always: Training Humans to Face AI Risks / 2508.03714 / ISBN:https://doi.org/10.48550/arXiv.2508.03714 / Published by ArXiv / Version released on 2025-07-23 / on (web) Publishing site


A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems / 2508.07407 / ISBN:https://doi.org/10.48550/arXiv.2508.07407 / Published by ArXiv / Version released on 2025-08-31 / on (web) Publishing site


Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance / 2508.08789 / ISBN:https://doi.org/10.48550/arXiv.2508.08789 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


A Systematic Survey of Model Extraction Attacks and Defenses: State-of-the-Art and Perspectives / 2508.15031 / ISBN:https://doi.org/10.48550/arXiv.2508.15031 / Published by ArXiv / Version released on 2025-08-27 / on (web) Publishing site


Can AI be Auditable? / 2509.00575 / ISBN:https://doi.org/10.48550/arXiv.2509.00575 / Published by ArXiv / Version released on 2025-09-14 / on (web) Publishing site


The Architecture of AI Transformation: Four Strategic Patterns and an Emerging Frontier / 2509.02853 / ISBN:https://doi.org/10.48550/arXiv.2509.02853 / Published by ArXiv / Version released on 2025-09-10 / on (web) Publishing site


Beyond Ethical Alignment: Evaluating LLMs as Artificial Moral Assistants / 2508.12754 / ISBN:https://doi.org/10.48550/arXiv.2508.12754 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


CAI Fluency: A Framework for Cybersecurity AI Fluency / 2508.13588 / ISBN:https://doi.org/10.48550/arXiv.2508.13588 / Published by ArXiv / Version released on 2025-10-07 / on (web) Publishing site


The AI-Fraud Diamond: A Novel Lens for Auditing Algorithmic Deception / 2508.13984 / ISBN:https://doi.org/10.48550/arXiv.2508.13984 / Published by ArXiv / Version released on 2025-08-19 / on (web) Publishing site


The Agent Behavior: Model, Governance and Challenges in the AI Digital Age / 2508.14415 / ISBN:https://doi.org/10.48550/arXiv.2508.14415 / Published by ArXiv / Version released on 2025-08-20 / on (web) Publishing site


Ethics of Artificial Intelligence / 2508.16658 / ISBN:https://doi.org/10.48550/arXiv.2508.16658 / Published by ArXiv / Version released on 2025-08-20 / on (web) Publishing site


A Study on the Framework for Evaluating the Ethics and Trustworthiness of Generative AI / 2509.00398 / ISBN:https://doi.org/10.48550/arXiv.2509.00398 / Published by ArXiv / Version released on 2025-10-28 / on (web) Publishing site


Between a Rock and a Hard Place: Exploiting Ethical Reasoning to Jailbreak LLMs / 2509.05367 / ISBN:https://doi.org/10.48550/arXiv.2509.05367 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site


A Maslow-Inspired Hierarchy of Engagement with AI Model / 2509.07032 / ISBN:https://doi.org/10.48550/arXiv.2509.07032 / Published by ArXiv / Version released on 2025-09-07 / on (web) Publishing site


AI and the Future of Academic Peer Review / 2509.14189 / ISBN:https://doi.org/10.48550/arXiv.2509.14189 / Published by ArXiv / Version released on 2026-02-27 / on (web) Publishing site


A five-layer framework for AI governance: integrating regulation, standards, and certification / 2509.11332 / ISBN:https://doi.org/10.48550/arXiv.2509.11332 / Published by ArXiv / Version released on 2025-09-14 / on (web) Publishing site


Digital Sovereignty Control Framework for Military AI-based Cyber Security / 2509.13072 / ISBN:https://doi.org/10.48550/arXiv.2509.13072 / Published by ArXiv / Version released on 2025-09-16 / on (web) Publishing site


Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models / 2509.16332 / ISBN:https://doi.org/10.48550/arXiv.2509.16332 / Published by ArXiv / Version released on 2025-09-19 / on (web) Publishing site


Fully Autonomous AI Agents Should Not be Developed / 2502.02649 / ISBN:https://doi.org/10.48550/arXiv.2502.02649 / Published by ArXiv / Version released on 2025-10-20 / on (web) Publishing site


The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs / 2506.11094 / ISBN:https://doi.org/10.48550/arXiv.2506.11094 / Published by ArXiv / Version released on 2025-10-30 / on (web) Publishing site


Understanding AI Trustworthiness: A Scoping Review of AIES & FAccT Articles / 2510.21293 / ISBN:https://doi.org/10.48550/arXiv.2510.21293 / Published by ArXiv / Version released on 2025-10-28 / on (web) Publishing site


Argumentation-Based Explainability for Legal AI: Comparative and Regulatory Perspectives / 2510.11079 / ISBN:https://doi.org/10.48550/arXiv.2510.11079 / Published by ArXiv / Version released on 2025-10-13 / on (web) Publishing site


AI Alignment vs. AI Ethical Treatment: 10 Challenges / 2510.12844 / ISBN:https://doi.org/10.48550/arXiv.2510.12844 / Published by ArXiv / Version released on 2025-10-14 / on (web) Publishing site


Trust in foundation models and GenAI: A geographic perspective / 2510.17942 / ISBN:https://doi.org/10.48550/arXiv.2510.17942 / Published by ArXiv / Version released on 2025-10-20 / on (web) Publishing site


How Can AI Augment Access to Justice? Public Defenders' Perspectives on AI Adoption / 2510.22933 / ISBN:https://doi.org/10.48550/arXiv.2510.22933 / Published by ArXiv / Version released on 2026-04-25 / on (web) Publishing site


Designing and Evaluating Malinowski's Lens: An AI-Native Educational Game for Ethnographic Learning / 2511.07682 / ISBN:https://doi.org/10.48550/arXiv.2511.07682 / Published by ArXiv / Version released on 2025-11-10 / on (web) Publishing site


Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming / 2511.15998 / ISBN:https://doi.org/10.48550/arXiv.2511.15998 / Published by ArXiv / Version released on 2025-11-21 / on (web) Publishing site


A Lexical Analysis of online Reviews on Human-AI Interactions / 2511.13480 / ISBN:https://doi.org/10.48550/arXiv.2511.13480 / Published by ArXiv / Version released on 2025-11-17 / on (web) Publishing site


Hybrid Neuro-Symbolic Models for Ethical AI in Risk-Sensitive Domains / 2511.17644 / ISBN:https://doi.org/10.48550/arXiv.2511.17644 / Published by ArXiv / Version released on 2025-11-20 / on (web) Publishing site


The Workflow as Medium: A Framework for Navigating Human-AI Co-Creation / 2511.18182 / ISBN:https://doi.org/10.48550/arXiv.2511.18182 / Published by ArXiv / Version released on 2025-11-22 / on (web) Publishing site


Who Owns the Knowledge? Copyright, GenAI, and the Future of Academic Publishing / 2511.21755 / ISBN:https://doi.org/10.48550/arXiv.2511.21755 / Published by ArXiv / Version released on 2026-01-18 / on (web) Publishing site


The Decision Path to Control AI Risks Completely: Fundamental Control Mechanisms for AI Governance / 2512.04489 / ISBN:https://doi.org/10.48550/arXiv.2512.04489 / Published by ArXiv / Version released on 2025-12-24 / on (web) Publishing site


A Framework for Responsible AI Systems: Building Societal Trust through Domain Definition, Trustworthy AI Design, Auditability, Accountability, and Governance / 2503.04739 / ISBN:https://doi.org/10.48550/arXiv.2503.04739 / Published by ArXiv / Version released on 2026-01-08 / on (web) Publishing site


PrivacyBench: A Conversational Benchmark for Evaluating Privacy in Personalized AI / 2512.24848 / ISBN:https://doi.org/10.48550/arXiv.2512.24848 / Published by ArXiv / Version released on 2025-12-31 / on (web) Publishing site


Legal Alignment for Safe and Ethical AI / 2601.04175 / ISBN:https://doi.org/10.48550/arXiv.2601.04175 / Published by ArXiv / Version released on 2026-01-07 / on (web) Publishing site


Epistemic Constitutionalism Or: how to avoid coherence bias / 2601.14295 / ISBN:https://doi.org/10.48550/arXiv.2601.14295 / Published by ArXiv / Version released on 2026-04-22 / on (web) Publishing site


AI Agents vs. Human Investigators: Balancing Automation, Security, and Expertise in Cyber Forensic Analysis / 2601.14544 / ISBN:https://doi.org/10.48550/arXiv.2601.14544 / Published by ArXiv / Version released on 2026-01-20 / on (web) Publishing site


Human Society-Inspired Approaches to Agentic AI Security: The 4C Framework / 2602.01942 / ISBN:https://doi.org/10.48550/arXiv.2602.01942 / Published by ArXiv / Version released on 2026-02-02 / on (web) Publishing site


A Human-Centered Privacy Approach (HCP) to AI / 2602.04616 / ISBN:https://doi.org/10.48550/arXiv.2602.04616 / Published by ArXiv / Version released on 2026-02-04 / on (web) Publishing site


Reliable and Responsible Foundation Models: A Comprehensive Survey / 2602.08145 / ISBN:https://doi.org/10.48550/arXiv.2602.08145 / Published by ArXiv / Version released on 2026-02-04 / on (web) Publishing site


Insidious Imaginaries: A Critical Overview of AI Speculations / 2602.17383 / ISBN:https://doi.org/10.34626/2025_xcoax_001 / Version released on 2026-02-19 / on (web) Publishing site


Dark and Bright Side of Participatory Red-Teaming with Targets of Stereotyping for Eliciting Harmful Behaviors from Large Language Models / 2602.19124 / ISBN:https://doi.org/10.48550/arXiv.2602.19124 / Version released on 2026-02-22 / on (web) Publishing site


Can LLMs Synthesize Court-Ready Statistical Evidence? Evaluating AI-Assisted Sentencing Bias Analysis for California Racial Justice Act Claims / 2603.04804 / ISBN:https://doi.org/10.48550/arXiv.2603.04804 / Version released on 2026-03-05 / on (web) Publishing site


Must Read: A Comprehensive Survey of Computational Persuasion / 2505.07775 / ISBN:https://doi.org/10.48550/arXiv.2505.07775 / Version released on 2026-03-23 / on (web) Publishing site


The Landscape of Generative AI in Information Systems: A Synthesis of Secondary Reviews and Research Agendas / 2603.11842 / ISBN:https://doi.org/10.48550/arXiv.2603.11842 / Version released on 2026-03-12 / on (web) Publishing site


AI Integrity: A New Paradigm for Verifiable AI Governance / 2604.11065 / ISBN:https://doi.org/10.48550/arXiv.2604.11065 / Version released on 2026-04-13 / on (web) Publishing site


Ambient Persuasion in a Deployed AI Agent: Unauthorized Escalation Following Routine Non-Adversarial Content Exposure / 2605.00055 / ISBN:https://doi.org/10.48550/arXiv.2605.00055 / Version released on 2026-04-29 / on (web) Publishing site


Reflections and New Directions for Human-Centered Large Language Models / 2605.06901 / ISBN:https://doi.org/10.48550/arXiv.2605.06901 / Version released on 2026-05-07 / on (web) Publishing site