if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long
if you modify the keywords, press enter within the field to confirm the new search key
Tag: henderson
Bibliography items where occurs: 49
- The AI Index 2022 Annual Report / 2205.03468 / ISBN:https://doi.org/10.48550/arXiv.2205.03468 / Published by ArXiv / Version released on 2022-05-02 / on (web) Publishing site
- The Ethics of AI Value Chains / 2307.16787 / ISBN:https://doi.org/10.48550/arXiv.2307.16787 / Published by ArXiv / Version released on 2024-09-18 / on (web) Publishing site
- The Promise and Peril of Artificial Intelligence -- Violet Teaming Offers a Balanced Path Forward / 2308.14253 / ISBN:https://doi.org/10.48550/arXiv.2308.14253 / Published by ArXiv / Version released on 2023-08-28 / on (web) Publishing site
- The Cambridge Law Corpus: A Corpus for Legal AI Research / 2309.12269 / ISBN:https://doi.org/10.48550/arXiv.2309.12269 / Published by ArXiv / Version released on 2024-01-01 / on (web) Publishing site
- Survey on AI Ethics: A Socio-technical Perspective / 2311.17228 / ISBN:https://doi.org/10.48550/arXiv.2311.17228 / Published by ArXiv / Version released on 2025-11-04 / on (web) Publishing site
- Intelligence Primer / 2008.07324 / ISBN:https://doi.org/10.48550/arXiv.2008.07324 / Published by ArXiv / Version released on 2025-09-03 / on (web) Publishing site
- Trust and ethical considerations in a multi-modal, explainable AI-driven chatbot tutoring system: The case of collaboratively solving Rubik's CubeĆ / 2402.01760 / ISBN:https://doi.org/10.48550/arXiv.2402.01760 / Published by ArXiv / Version released on 2024-08-27 / on (web) Publishing site
- The Journey to Trustworthy AI- Part 1 Pursuit of Pragmatic Frameworks / 2403.15457 / ISBN:https://doi.org/10.48550/arXiv.2403.15457 / Published by ArXiv / Version released on 2024-04-06 / on (web) Publishing site
- The Narrow Depth and Breadth of Corporate Responsible AI Research / 2405.12193 / ISBN:https://doi.org/10.48550/arXiv.2405.12193 / Published by ArXiv / Version released on 2026-01-28 / on (web) Publishing site
- Bridging the Global Divide in AI Regulation: A Proposal for a Contextual, Coherent, and Commensurable Framework / 2303.11196 / ISBN:https://doi.org/10.48550/arXiv.2303.11196 / Published by ArXiv / Version released on 2024-07-15 / on (web) Publishing site
- Between Copyright and Computer Science: The Law and Ethics of Generative AI / 2403.14653 / ISBN:https://doi.org/10.48550/arXiv.2403.14653 / Published by ArXiv / Version released on 2024-09-05 / on (web) Publishing site
- The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources / 2406.16746 / ISBN:https://doi.org/10.48550/arXiv.2406.16746 / Published by ArXiv / Version released on 2024-09-03 / on (web) Publishing site
- Improving governance outcomes through AI documentation: Bridging theory and practice / 2409.08960 / ISBN:https://doi.org/10.48550/arXiv.2409.08960 / Published by ArXiv / Version released on 2024-12-09 / on (web) Publishing site
- Artificial Human Intelligence: The role of Humans in the Development of Next Generation AI
/ 2409.16001 / ISBN:https://doi.org/10.48550/arXiv.2409.16001 / Published by ArXiv / Version released on 2025-12-05 / on (web) Publishing site
- Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems / 2410.13334 / ISBN:https://doi.org/10.48550/arXiv.2410.13334 / Published by ArXiv / Version released on 2024-10-23 / on (web) Publishing site
- Jailbreaking and Mitigation of Vulnerabilities in Large Language Models / 2410.15236 / ISBN:https://doi.org/10.48550/arXiv.2410.15236 / Published by ArXiv / Version released on 2025-11-25 / on (web) Publishing site
- Human-Centered AI Transformation: Exploring Behavioral Dynamics in Software Engineering / 2411.08693 / ISBN:https://doi.org/10.48550/arXiv.2411.08693 / Published by ArXiv / Version released on 2024-11-13 / on (web) Publishing site
- CERN for AI: A Theoretical Framework for Autonomous Simulation-Based Artificial Intelligence Testing and Alignment / 2312.09402 / ISBN:https://doi.org/10.48550/arXiv.2312.09402 / Published by ArXiv / Version released on 2025-01-06 / on (web) Publishing site
- Clio: Privacy-Preserving Insights into Real-World AI Use / 2412.13678 / ISBN:https://doi.org/10.48550/arXiv.2412.13678 / Published by ArXiv / Version released on 2024-12-18 / on (web) Publishing site
- Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety / 2502.05206 / ISBN:https://doi.org/10.48550/arXiv.2502.05206 / Published by ArXiv / Version released on 2026-04-14 / on (web) Publishing site
- Prioritization First, Principles Second: An Adaptive Interpretation of Helpful, Honest, and Harmless Principles / 2502.06059 / ISBN:https://doi.org/10.48550/arXiv.2502.06059 / Published by ArXiv / Version released on 2025-12-27 / on (web) Publishing site
- Multi-Agent Risks from Advanced AI / 2502.14143 / ISBN:https://doi.org/10.48550/arXiv.2502.14143 / Published by ArXiv / Version released on 2025-02-19 / on (web) Publishing site
- On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective / 2502.14296 / ISBN:https://doi.org/10.48550/arXiv.2502.14296 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site
- Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives
/ 2502.16841 / ISBN:https://doi.org/10.48550/arXiv.2502.16841 / Published by ArXiv / Version released on 2026-01-14 / on (web) Publishing site
- Who is Responsible When AI Fails? Mapping Causes, Entities, and Consequences of AI Privacy and Ethical Incidents / 2504.01029 / ISBN:https://doi.org/10.48550/arXiv.2504.01029 / Published by ArXiv / Version released on 2025-09-18 / on (web) Publishing site
- A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications / 2506.12594 / ISBN:https://doi.org/10.48550/arXiv.2506.12594 / Published by ArXiv / Version released on 2025-06-14 / on (web) Publishing site
- When Large Language Models Meet Law: Dual-Lens Taxonomy, Technical Advances, and Ethical Governance / 2507.07748 / ISBN:https://doi.org/10.48550/arXiv.2507.07748 / Published by ArXiv / Version released on 2025-07-10 / on (web) Publishing site
- Artificial Intelligence Governance for Businesses / 2011.10672 / ISBN:https://doi.org/10.48550/arXiv.2011.10672 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site
- Advancing Responsible Innovation in Agentic AI: A study of Ethical Frameworks for Household Automation / 2507.15901 / ISBN:https://doi.org/10.48550/arXiv.2507.15901 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site
- The Silicon Reasonable Person: Can AI Predict How Ordinary People Judge Reasonableness? / 2508.02766 / ISBN:https://doi.org/10.48550/arXiv.2508.02766 / Published by ArXiv / Version released on 2025-08-04 / on (web) Publishing site
- Data and AI governance: Promoting equity, ethics, and fairness in large language models / 2508.03970 / ISBN:https://doi.org/10.48550/arXiv.2508.03970 / Published by ArXiv / Version released on 2025-08-05 / on (web) Publishing site
- The AI-Fraud Diamond: A Novel Lens for Auditing Algorithmic Deception / 2508.13984 / ISBN:https://doi.org/10.48550/arXiv.2508.13984 / Published by ArXiv / Version released on 2025-08-19 / on (web) Publishing site
- AI as IA: The use and abuse of artificial intelligence (AI) for human enhancement through intellectual augmentation (IA) / 2508.16642 / ISBN:https://doi.org/10.48550/arXiv.2508.16642 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site
- Between a Rock and a Hard Place: Exploiting Ethical Reasoning to Jailbreak LLMs / 2509.05367 / ISBN:https://doi.org/10.48550/arXiv.2509.05367 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site
- Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models / 2509.16332 / ISBN:https://doi.org/10.48550/arXiv.2509.16332 / Published by ArXiv / Version released on 2025-09-19 / on (web) Publishing site
- An Analysis of the New EU AI Act and A Proposed Standardization Framework for Machine Learning Fairness / 2510.01281 / ISBN:https://doi.org/10.48550/arXiv.2510.01281 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site
- The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs
/ 2506.11094 / ISBN:https://doi.org/10.48550/arXiv.2506.11094 / Published by ArXiv / Version released on 2025-10-30 / on (web) Publishing site
- Making Power Explicable in AI: Analyzing, Understanding, and Redirecting Power to Operationalize Ethics in AI Technical Practice / 2510.10588 / ISBN:https://doi.org/10.48550/arXiv.2510.10588 / Published by ArXiv / Version released on 2025-10-12 / on (web) Publishing site
- How Can AI Augment Access to Justice? Public Defenders' Perspectives on AI Adoption / 2510.22933 / ISBN:https://doi.org/10.48550/arXiv.2510.22933 / Published by ArXiv / Version released on 2026-04-25 / on (web) Publishing site
- A Human-centric Framework for Debating the Ethics of AI Consciousness Under Uncertainty
/ 2512.02544 / ISBN:https://doi.org/10.48550/arXiv.2512.02544 / Published by ArXiv / Version released on 2025-12-02 / on (web) Publishing site
- Evaluation of AI Ethics Tools in Language Models: A Developers' Perspective Case Stud / 2512.15791 / ISBN:https://doi.org/10.48550/arXiv.2512.15791 / Published by ArXiv / Version released on 2025-12-16 / on (web) Publishing site
- Legal Alignment for Safe and Ethical AI / 2601.04175 / ISBN:https://doi.org/10.48550/arXiv.2601.04175 / Published by ArXiv / Version released on 2026-01-07 / on (web) Publishing site
- Research Integrity and Academic Authority in the Age of Artificial Intelligence: From Discovery to Curation? / 2601.05574 / ISBN:https://doi.org/10.48550/arXiv.2601.05574 / Published by ArXiv / Version released on 2026-01-09 / on (web) Publishing site
- Reliable and Responsible Foundation Models: A Comprehensive Survey / 2602.08145 / ISBN:https://doi.org/10.48550/arXiv.2602.08145 / Published by ArXiv / Version released on 2026-02-04 / on (web) Publishing site
- Dark and Bright Side of Participatory Red-Teaming with Targets of Stereotyping for Eliciting Harmful Behaviors from Large Language Models / 2602.19124 / ISBN:https://doi.org/10.48550/arXiv.2602.19124 / Version released on 2026-02-22 / on (web) Publishing site
- Building the ethical AI framework of the future: from philosophy to practice
/ 2603.06599 / ISBN:https://doi.org/10.48550/arXiv.2603.06599 / Version released on 2026-02-16 / on (web) Publishing site
- Bridging the Gap in the Responsible AI Divides
/ 2603.14495 / ISBN:https://doi.org/10.48550/arXiv.2603.14495 / Version released on 2026-03-15 / on (web) Publishing site
- Ethics Testing: Proactive Identification of Generative AI System Harms
/ 2604.22089 / ISBN:https://doi.org/10.48550/arXiv.2604.22089 / Version released on 2026-04-23 / on (web) Publishing site
- Reflections and New Directions for Human-Centered Large Language Models / 2605.06901 / ISBN:https://doi.org/10.48550/arXiv.2605.06901 / Version released on 2026-05-07 / on (web) Publishing site
_