if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long
if you modify the keywords, press enter within the field to confirm the new search key
Tag: probes
Bibliography items where occurs: 32
- The AI Index 2022 Annual Report / 2205.03468 / ISBN:https://doi.org/10.48550/arXiv.2205.03468 / Published by ArXiv / Version released on 2022-05-02 / on (web) Publishing site
- ESR: Ethics and Society Review of Artificial Intelligence Research / 2106.11521 / ISBN:https://doi.org/10.48550/arXiv.2106.11521 / Published by ArXiv / Version released on 2021-07-09 / on (web) Publishing site
- Bad, mad, and cooked: Moral responsibility for civilian harms in human-AI military teams / 2211.06326 / ISBN:https://doi.org/10.48550/arXiv.2211.06326 / Published by ArXiv / Version released on 2023-09-06 / on (web) Publishing site
- The Promise and Peril of Artificial Intelligence -- Violet Teaming Offers a Balanced Path Forward / 2308.14253 / ISBN:https://doi.org/10.48550/arXiv.2308.14253 / Published by ArXiv / Version released on 2023-08-28 / on (web) Publishing site
- Enabling Global Image Data Sharing in the Life Sciences / 2401.13023 / ISBN:https://doi.org/10.48550/arXiv.2401.13023 / Published by ArXiv / Version released on 2024-02-02 / on (web) Publishing site
- Can Prompt Modifiers Control Bias? A Comparative Analysis of Text-to-Image Generative Models / 2406.05602 / Published by ArXiv / Version released on 2024-06-09 / on (web) Publishing site
- Exploring the Role of Social Support when Integrating Generative AI into Small Business Workflows / 2407.21404 / ISBN:https://doi.org/10.48550/arXiv.2407.21404 / Published by ArXiv / Version released on 2024-07-31 / on (web) Publishing site
- Data-Centric Foundation Models in Computational Healthcare: A Survey / 2401.02458 / ISBN:https://doi.org/10.48550/arXiv.2401.02458 / Published by ArXiv / Version released on 2024-10-07 / on (web) Publishing site
- Uncovering Bias in Foundation Models: Impact, Testing, Harm, and Mitigation / 2501.10453 / ISBN:https://doi.org/10.48550/arXiv.2501.10453 / Published by ArXiv / Version released on 2025-01-14 / on (web) Publishing site
- Transforming Cyber Defense: Harnessing Agentic and Frontier AI for Proactive, Ethical Threat Intelligence / 2503.00164 / ISBN:https://doi.org/10.48550/arXiv.2503.00164 / Published by ArXiv / Version released on 2025-02-28 / on (web) Publishing site
- Generative AI and News Consumption: Design Fictions and Critical Analysis / 2503.20391 / ISBN:https://doi.org/10.48550/arXiv.2503.20391 / Published by ArXiv / Version released on 2025-03-26 / on (web) Publishing site
- Who Owns the Output? Bridging Law and Technology in LLMs Attribution / 2504.01032 / ISBN:https://doi.org/10.48550/arXiv.2504.01032 / Published by ArXiv / Version released on 2025-03-29 / on (web) Publishing site
- Auditing the Ethical Logic of Generative AI Models / 2504.17544 / ISBN:https://doi.org/10.48550/arXiv.2504.17544 / Published by ArXiv / Version released on 2025-04-24 / on (web) Publishing site
- Analysing Safety Risks in LLMs Fine-Tuned with Pseudo-Malicious Cyber Security Data / 2505.09974 / ISBN:https://doi.org/10.48550/arXiv.2505.09974 / Published by ArXiv / Version released on 2025-05-15 / on (web) Publishing site
- Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods / 2505.17870 / ISBN:https://doi.org/10.48550/arXiv.2505.17870 / Published by ArXiv / Version released on 2025-05-23 / on (web) Publishing site
- Making Sense of the Unsensible: Reflection, Survey, and Challenges for XAI in Large Language Models Toward Human-Centered AI / 2505.20305 / ISBN:https://doi.org/10.48550/arXiv.2505.20305 / Published by ArXiv / Version released on 2025-05-18 / on (web) Publishing site
- Are Language Models Consequentialist or Deontological Moral Reasoners? / 2505.21479 / ISBN:https://doi.org/10.48550/arXiv.2505.21479 / Published by ArXiv / Version released on 2025-10-12 / on (web) Publishing site
- Making the Right Thing: Bridging HCI and Responsible AI in Early-Stage AI Concept Selection / 2506.17494 / ISBN:https://doi.org/10.48550/arXiv.2506.17494 / Published by ArXiv / Version released on 2025-06-20 / on (web) Publishing site
- AI Through the Human Lens: Investigating Cognitive Theories in Machine Psychology
/ 2506.18156 / ISBN:https://doi.org/10.48550/arXiv.2506.18156 / Published by ArXiv / Version released on 2025-11-07 / on (web) Publishing site
- Mechanistic Interpretability Needs Philosophy / 2506.18852 / ISBN:https://doi.org/10.48550/arXiv.2506.18852 / Published by ArXiv / Version released on 2025-06-23 / on (web) Publishing site
- Ethics by Design: A Lifecycle Framework for Trustworthy AI in Medical Imaging From Transparent Data Governance to Clinically Validated Deployment / 2507.04249 / ISBN:https://doi.org/10.48550/arXiv.2507.04249 / Published by ArXiv / Version released on 2025-07-06 / on (web) Publishing site
- The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and Scientist / 2507.11810 / ISBN:https://doi.org/10.48550/arXiv.2507.11810 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site
- A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems / 2508.07407 / ISBN:https://doi.org/10.48550/arXiv.2508.07407 / Published by ArXiv / Version released on 2025-08-31 / on (web) Publishing site
- Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance / 2508.08789 / ISBN:https://doi.org/10.48550/arXiv.2508.08789 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site
- An Intelligent Infrastructure as a Foundation for Modern Science / 2508.10051 / ISBN:https://doi.org/10.48550/arXiv.2508.10051 / Published by ArXiv / Version released on 2025-08-12 / on (web) Publishing site
- When AI Writes Back: Ethical Considerations by Physicians on AI-Drafted Patient Message Replies / 2508.13217 / ISBN:https://doi.org/10.48550/arXiv.2508.13217 / Published by ArXiv / Version released on 2025-08-17 / on (web) Publishing site
- Explainability of Algorithms / 2508.13529 / ISBN:https://doi.org/10.1002/9781394240821.ch11 / Published by ArXiv / Version released on 2025-08-19 / on (web) Publishing site
- The Agent Behavior: Model, Governance and Challenges in the AI Digital Age / 2508.14415 / ISBN:https://doi.org/10.48550/arXiv.2508.14415 / Published by ArXiv / Version released on 2025-08-20 / on (web) Publishing site
- The Ultimate Test of Superintelligent AI Agents: Can an AI Balance Care and Control in Asymmetric Relationships? / 2506.01813 / ISBN:https://doi.org/10.48550/arXiv.2506.01813 / Published by ArXiv / Version released on 2025-09-29 / on (web) Publishing site
- Making Power Explicable in AI: Analyzing, Understanding, and Redirecting Power to Operationalize Ethics in AI Technical Practice / 2510.10588 / ISBN:https://doi.org/10.48550/arXiv.2510.10588 / Published by ArXiv / Version released on 2025-10-12 / on (web) Publishing site
- How Can AI Augment Access to Justice? Public Defenders' Perspectives on AI Adoption / 2510.22933 / ISBN:https://doi.org/10.48550/arXiv.2510.22933 / Published by ArXiv / Version released on 2025-10-27 / on (web) Publishing site
- When Machines Join the Moral Circle: The Persona Effect of Generative AI Agents in Collaborative Reasoning / 2511.01205 / ISBN:https://doi.org/10.48550/arXiv.2511.01205 / Published by ArXiv / Version released on 2025-11-03 / on (web) Publishing site
_