if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long
if you modify the keywords, press enter within the field to confirm the new search key
Tag: dangerous
Bibliography items where occurs: 161
- Exciting, Useful, Worrying, Futuristic:
Public Perception of Artificial Intelligence in 8 Countries / 2001.00081 / ISBN:https://doi.org/10.48550/arXiv.2001.00081 / Published by ArXiv / Version released on 2021-05-18 / on (web) Publishing site
- A Framework for Ethical AI at the United Nations / 2104.12547 / ISBN:https://doi.org/10.48550/arXiv.2104.12547 / Published by ArXiv / Version released on 2021-04-09 / on (web) Publishing site
- On the Current and Emerging Challenges of Developing Fair and Ethical AI Solutions in Financial Services / 2111.01306 / ISBN:https://doi.org/10.48550/arXiv.2111.01306 / Published by ArXiv / Version released on 2021-11-02 / on (web) Publishing site
- From Military to Healthcare: Adopting and Expanding Ethical Principles for Generative Artificial Intelligence / 2308.02448 / ISBN:https://doi.org/10.48550/arXiv.2308.02448 / Published by ArXiv / Version released on 2023-08-04 / on (web) Publishing site
- Ethical Considerations and Policy Implications for Large Language Models: Guiding Responsible Development and Deployment / 2308.02678 / ISBN:https://doi.org/10.48550/arXiv.2308.02678 / Published by ArXiv / Version released on 2023-08-01 / on (web) Publishing site
- Dual Governance: The intersection of centralized regulation and crowdsourced safety mechanisms for Generative AI / 2308.04448 / ISBN:https://doi.org/10.48550/arXiv.2308.04448 / Published by ArXiv / Version released on 2023-08-02 / on (web) Publishing site
- The Promise and Peril of Artificial Intelligence -- Violet Teaming Offers a Balanced Path Forward / 2308.14253 / ISBN:https://doi.org/10.48550/arXiv.2308.14253 / Published by ArXiv / Version released on 2023-08-28 / on (web) Publishing site
- The AI Incident Database as an Educational Tool to Raise Awareness of AI Harms: A Classroom Exploration of Efficacy, Limitations, & Future Improvements / 2310.06269 / ISBN:https://doi.org/10.48550/arXiv.2310.06269 / Published by ArXiv / Version released on 2023-10-10 / on (web) Publishing site
- A Review of the Ethics of Artificial Intelligence and its Applications in the United States / 2310.05751 / ISBN:https://doi.org/10.48550/arXiv.2310.05751 / Published by ArXiv / Version released on 2023-10-09 / on (web) Publishing site
- Ethics of Artificial Intelligence and Robotics in the Architecture, Engineering, and Construction Industry / 2310.05414 / ISBN:https://doi.org/10.48550/arXiv.2310.05414 / Published by ArXiv / Version released on 2023-10-09 / on (web) Publishing site
- Towards A Unified Utilitarian Ethics Framework for Healthcare Artificial Intelligence / 2309.14617 / ISBN:https://doi.org/10.48550/arXiv.2309.14617 / Published by ArXiv / Version released on 2023-09-26 / on (web) Publishing site
- In Consideration of Indigenous Data Sovereignty: Data Mining as a Colonial Practice / 2309.10215 / ISBN:https://doi.org/10.48550/arXiv.2309.10215 / Published by ArXiv / Version released on 2023-09-19 / on (web) Publishing site
- Specific versus General Principles for Constitutional AI / 2310.13798 / ISBN:https://doi.org/10.48550/arXiv.2310.13798 / Published by ArXiv / Version released on 2023-10-20 / on (web) Publishing site
- Human participants in AI research: Ethics and transparency in practice / 2311.01254 / ISBN:https://doi.org/10.48550/arXiv.2311.01254 / Published by ArXiv / Version released on 2024-09-26 / on (web) Publishing site
- The Rise of Creative Machines: Exploring the Impact of Generative AI / 2311.13262 / ISBN:https://doi.org/10.48550/arXiv.2311.13262 / Published by ArXiv / Version released on 2023-11-22 / on (web) Publishing site
- Deepfakes, Misinformation, and Disinformation in the Era of Frontier AI, Generative AI, and Large AI Models / 2311.17394 / ISBN:https://doi.org/10.48550/arXiv.2311.17394 / Published by ArXiv / Version released on 2023-11-29 / on (web) Publishing site
- From Lab to Field: Real-World Evaluation of an AI-Driven Smart Video Solution to Enhance Community Safety / 2312.02078 / ISBN:https://doi.org/10.48550/arXiv.2312.02078 / Published by ArXiv / Version released on 2025-08-12 / on (web) Publishing site
- Intelligence Primer / 2008.07324 / ISBN:https://doi.org/10.48550/arXiv.2008.07324 / Published by ArXiv / Version released on 2025-09-03 / on (web) Publishing site
- Control Risk for Potential Misuse of Artificial Intelligence in Science / 2312.06632 / ISBN:https://doi.org/10.48550/arXiv.2312.06632 / Published by ArXiv / Version released on 2023-12-11 / on (web) Publishing site
- Designing Guiding Principles for NLP for Healthcare: A Case Study of Maternal Health / 2312.11803 / ISBN:https://doi.org/10.48550/arXiv.2312.11803 / Published by ArXiv / Version released on 2023-12-19 / on (web) Publishing site
- AI Ethics Principles in Practice: Perspectives of Designers and Developers / 2112.07467 / ISBN:https://doi.org/10.48550/arXiv.2112.07467 / Published by ArXiv / Version released on 2024-09-06 / on (web) Publishing site
- Trust and ethical considerations in a multi-modal, explainable AI-driven chatbot tutoring system: The case of collaboratively solving Rubik's CubeĆ / 2402.01760 / ISBN:https://doi.org/10.48550/arXiv.2402.01760 / Published by ArXiv / Version released on 2024-08-27 / on (web) Publishing site
- Commercial AI, Conflict, and Moral Responsibility: A theoretical analysis and practical approach to the moral responsibilities associated with dual-use AI technology / 2402.01762 / ISBN:https://doi.org/10.48550/arXiv.2402.01762 / Published by ArXiv / Version released on 2024-01-30 / on (web) Publishing site
- Mapping the Ethics of Generative AI: A Comprehensive Scoping Review / 2402.08323 / ISBN:https://doi.org/10.48550/arXiv.2402.08323 / Published by ArXiv / Version released on 2024-02-13 / on (web) Publishing site
- Towards an AI-Enhanced Cyber Threat Intelligence Processing Pipeline / 2403.03265 / ISBN:https://doi.org/10.48550/arXiv.2403.03265 / Published by ArXiv / Version released on 2024-03-05 / on (web) Publishing site
- A Survey on Human-AI Collaboration with Large Foundation Models / 2403.04931 / ISBN:https://doi.org/10.48550/arXiv.2403.04931 / Published by ArXiv / Version released on 2025-09-02 / on (web) Publishing site
- Review of Generative AI Methods in Cybersecurity / 2403.08701 / ISBN:https://doi.org/10.48550/arXiv.2403.08701 / Published by ArXiv / Version released on 2024-03-19 / on (web) Publishing site
- Trust in AI: Progress, Challenges, and Future Directions / 2403.14680 / ISBN:https://doi.org/10.48550/arXiv.2403.14680 / Published by ArXiv / Version released on 2024-04-04 / on (web) Publishing site
- AI Act and Large Language Models (LLMs): When critical issues and privacy impact require human and ethical oversight / 2404.00600 / ISBN:https://doi.org/10.48550/arXiv.2404.00600 / Published by ArXiv / Version released on 2024-04-02 / on (web) Publishing site
- Balancing Progress and Responsibility: A Synthesis of Sustainability Trade-Offs of AI-Based Systems / 2404.03995 / ISBN:https://doi.org/10.48550/arXiv.2404.03995 / Published by ArXiv / Version released on 2024-04-05 / on (web) Publishing site
- Frontier AI Ethics: Anticipating and Evaluating the Societal Impacts of Language Model Agents / 2404.06750 / ISBN:https://arxiv.org/abs/2404.06750 / Published by ArXiv / Version released on 2024-10-18 / on (web) Publishing site
- AI Alignment: A Comprehensive Survey / 2310.19852 / ISBN:https://doi.org/10.48550/arXiv.2310.19852 / Published by ArXiv / Version released on 2025-04-04 / on (web) Publishing site
- Epistemic Power in AI Ethics Labor: Legitimizing Located Complaints / 2402.08171 / ISBN:https://doi.org/10.1145/3630106.3658973 / Published by ArXiv / Version released on 2024-04-17 / on (web) Publishing site
- Taxonomy to Regulation: A (Geo)Political Taxonomy for AI Risks and Regulatory Measures in the EU AI Act / 2404.11476 / ISBN:https://doi.org/10.48550/arXiv.2404.11476 / Published by ArXiv / Version released on 2024-04-17 / on (web) Publishing site
- The Necessity of AI Audit Standards Boards / 2404.13060 / ISBN:https://doi.org/10.48550/arXiv.2404.13060 / Published by ArXiv / Version released on 2024-04-11 / on (web) Publishing site
- War Elephants: Rethinking Combat AI and Human Oversight / 2404.19573 / ISBN:https://doi.org/10.48550/arXiv.2404.19573 / Published by ArXiv / Version released on 2024-04-30 / on (web) Publishing site
- AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research / 2405.01859 / ISBN:https://doi.org/10.48550/arXiv.2405.01859 / Published by ArXiv / Version released on 2024-05-31 / on (web) Publishing site
- Not My Voice! A Taxonomy of Ethical and Safety Harms of Speech Generators / 2402.01708 / ISBN:https://doi.org/10.48550/arXiv.2402.01708 / Published by ArXiv / Version released on 2024-05-15 / on (web) Publishing site
- The Wolf Within: Covert Injection of Malice into MLLM Societies via an MLLM Operative / 2402.14859 / ISBN:https://doi.org/10.48550/arXiv.2402.14859 / Published by ArXiv / Version released on 2024-06-03 / on (web) Publishing site
- Cyber Risks of Machine Translation Critical Errors : Arabic Mental Health Tweets as a Case Study / 2405.11668 / ISBN:https://doi.org/10.48550/arXiv.2405.11668 / Published by ArXiv / Version released on 2024-05-19 / on (web) Publishing site
- A Comprehensive Overview of Large Language Models (LLMs) for Cyber Defences: Opportunities and Directions / 2405.14487 / ISBN:https://doi.org/10.48550/arXiv.2405.14487 / Published by ArXiv / Version released on 2024-05-23 / on (web) Publishing site
- The ethical situation of DALL-E 2
/ 2405.19176 / ISBN:https://doi.org/10.48550/arXiv.2405.19176 / Published by ArXiv / Version released on 2024-05-29 / on (web) Publishing site
- The Future of Child Development in the AI Era. Cross-Disciplinary Perspectives Between AI and Child Development Experts / 2405.19275 / ISBN:https://doi.org/10.48550/arXiv.2405.19275 / Published by ArXiv / Version released on 2024-05-29 / on (web) Publishing site
- Gender Bias Detection in Court Decisions: A Brazilian Case Study / 2406.00393 / ISBN:https://doi.org/10.48550/arXiv.2406.00393 / Published by ArXiv / Version released on 2024-06-01 / on (web) Publishing site
- How Ethical Should AI Be? How AI Alignment Shapes the Risk Preferences of LLMs / 2406.01168 / ISBN:https://doi.org/10.48550/arXiv.2406.01168 / Published by ArXiv / Version released on 2024-08-01 / on (web) Publishing site
- Current state of LLM Risks and AI Guardrails / 2406.12934 / ISBN:https://doi.org/10.48550/arXiv.2406.12934 / Published by ArXiv / Version released on 2024-06-16 / on (web) Publishing site
- Documenting Ethical Considerations in Open Source AI Models / 2406.18071 / ISBN:https://doi.org/10.48550/arXiv.2406.18071 / Published by ArXiv / Version released on 2024-07-03 / on (web) Publishing site
- AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations / 2406.18346 / ISBN:https://doi.org/10.48550/arXiv.2406.18346 / Published by ArXiv / Version released on 2024-06-26 / on (web) Publishing site
- Staying vigilant in the Age of AI: From content generation to content authentication / 2407.00922 / ISBN:https://doi.org/10.48550/arXiv.2407.00922 / Published by ArXiv / Version released on 2024-07-01 / on (web) Publishing site
- Bridging the Global Divide in AI Regulation: A Proposal for a Contextual, Coherent, and Commensurable Framework / 2303.11196 / ISBN:https://doi.org/10.48550/arXiv.2303.11196 / Published by ArXiv / Version released on 2024-07-15 / on (web) Publishing site
- Thorns and Algorithms: Navigating Generative AI Challenges Inspired by Giraffes and Acacias / 2407.11360 / ISBN:https://doi.org/10.48550/arXiv.2407.11360 / Published by ArXiv / Version released on 2024-07.16 / on (web) Publishing site
- Prioritizing High-Consequence Biological Capabilities in Evaluations of Artificial Intelligence Models / 2407.13059 / ISBN:https://doi.org/10.48550/arXiv.2407.13059 / Published by ArXiv / Version released on 2024-07-23 / on (web) Publishing site
- Assurance of AI Systems From a Dependability Perspective / 2407.13948 / ISBN:https://doi.org/10.48550/arXiv.2407.13948 / Published by ArXiv / Version released on 2024-08-07 / on (web) Publishing site
- RogueGPT: dis-ethical tuning transforms ChatGPT4 into a Rogue AI in 158 Words / 2407.15009 / ISBN:https://doi.org/10.48550/arXiv.2407.15009 / Published by ArXiv / Version released on 2024-07-23 / on (web) Publishing site
- Nudging Using Autonomous Agents: Risks and Ethical Considerations / 2407.16362 / ISBN:https://doi.org/10.48550/arXiv.2407.16362 / Published by ArXiv / Version released on 2024-07-23 / on (web) Publishing site
- Mapping the individual, social, and biospheric impacts of Foundation Models / 2407.17129 / ISBN:https://doi.org/10.48550/arXiv.2407.17129 / Published by ArXiv / Version released on 2024-07-24 / on (web) Publishing site
- Speculations on Uncertainty and Humane Algorithms / 2408.06736 / ISBN:https://doi.org/10.48550/arXiv.2408.06736 / Published by ArXiv / Version released on 2024-08-13 / on (web) Publishing site
- Don't Kill the Baby: The Case for AI in Arbitration / 2408.11608 / ISBN:https://doi.org/10.48550/arXiv.2408.11608 / Published by ArXiv / Version released on 2025-03-22 / on (web) Publishing site
- CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical Researcher / 2408.11650 / ISBN:https://doi.org/10.48550/arXiv.2408.11650 / Published by ArXiv / Version released on 2024-11-06 / on (web) Publishing site
- What Is Required for Empathic AI? It Depends, and Why That Matters for AI Developers and Users / 2408.15354 / ISBN:https://doi.org/10.48550/arXiv.2408.15354 / Published by ArXiv / Version released on 2024-08-27 / on (web) Publishing site
- Digital Homunculi: Reimagining Democracy Research with Generative Agents / 2409.00826 / ISBN:https://doi.org/10.48550/arXiv.2409.00826 / Published by ArXiv / Version released on 2024-09-01 / on (web) Publishing site
- Synthetic Human Memories: AI-Edited Images and Videos Can Implant False Memories and Distort Recollection / 2409.08895 / ISBN:https://doi.org/10.48550/arXiv.2409.08895 / Published by ArXiv / Version released on 2024-09-13 / on (web) Publishing site
- Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure / 2409.19104 / ISBN:https://doi.org/10.48550/arXiv.2409.19104 / Published by ArXiv / Version released on 2024-09-27 / on (web) Publishing site
- Ethical software requirements from user reviews: A systematic literature review / 2410.01833 / ISBN:https://doi.org/10.48550/arXiv.2410.01833 / Published by ArXiv / Version released on 2024-09-18 / on (web) Publishing site
- From human-centered to social-centered artificial intelligence: Assessing ChatGPT's impact through disruptive events / 2306.00227 / ISBN:https://doi.org/10.48550/arXiv.2306.00227 / Published by ArXiv / Version released on 2024-10-25 / on (web) Publishing site
- Navigating the Cultural Kaleidoscope: A Hitchhiker's Guide to Sensitivity in Large Language Models
/ 2410.12880 / ISBN:https://doi.org/10.48550/arXiv.2410.12880 / Published by ArXiv / Version released on 2025-01-24 / on (web) Publishing site
- Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems / 2410.13334 / ISBN:https://doi.org/10.48550/arXiv.2410.13334 / Published by ArXiv / Version released on 2024-10-23 / on (web) Publishing site
- Jailbreaking and Mitigation of Vulnerabilities in Large Language Models / 2410.15236 / ISBN:https://doi.org/10.48550/arXiv.2410.15236 / Published by ArXiv / Version released on 2025-11-25 / on (web) Publishing site
- The Dark Side of AI Companionship: A Taxonomy of Harmful Algorithmic Behaviors in Human-AI Relationships / 2410.20130 / ISBN:https://doi.org/10.48550/arXiv.2410.20130 / Published by ArXiv / Version released on 2025-01-26 / on (web) Publishing site
- Standardization Trends on Safety and Trustworthiness Technology for Advanced AI / 2410.22151 / ISBN:https://doi.org/10.48550/arXiv.2410.22151 / Published by ArXiv / Version released on 2024-10-29 / on (web) Publishing site
- A Comprehensive Review of Multimodal XR Applications, Risks, and Ethical Challenges in the Metaverse / 2411.04508 / ISBN:https://doi.org/10.48550/arXiv.2411.04508 / Published by ArXiv / Version released on 2024-11-07 / on (web) Publishing site
- Persuasion with Large Language Models: a Survey / 2411.06837 / ISBN:https://doi.org/10.48550/arXiv.2411.06837 / Published by ArXiv / Version released on 2024-11-11 / on (web) Publishing site
- Good intentions, unintended consequences: exploring forecasting harms
/ 2411.16531 / ISBN:https://doi.org/10.48550/arXiv.2411.16531 / Published by ArXiv / Version released on 2025-03-12 / on (web) Publishing site
- Ethics and Artificial Intelligence Adoption / 2412.00330 / ISBN:https://doi.org/10.48550/arXiv.2412.00330 / Published by ArXiv / Version released on 2024-11-30 / on (web) Publishing site
- Towards a Practical Ethics of Generative AI in Creative Production Processes / 2412.03579 / ISBN:https://doi.org/10.48550/arXiv.2412.03579 / Published by ArXiv / Version released on 2024-11-18 / on (web) Publishing site
- Bots against Bias: Critical Next Steps for Human-Robot Interaction / 2412.12542 / ISBN:https://doi.org/10.1017/9781009386708.023 / Published by ArXiv / Version released on 2024-12-17 / on (web) Publishing site
- Autonomous Vehicle Security: A Deep Dive into Threat Modeling / 2412.15348 / ISBN:https://doi.org/10.48550/arXiv.2412.15348 / Published by ArXiv / Version released on 2024-12-19 / on (web) Publishing site
- Large Language Model Safety: A Holistic Survey / 2412.17686 / ISBN:https://doi.org/10.48550/arXiv.2412.17686 / Published by ArXiv / Version released on 2024-12-23 / on (web) Publishing site
- Addressing Intersectionality, Explainability, and Ethics in AI-Driven Diagnostics: A Rebuttal and Call for Transdiciplinary Action / 2501.08497 / ISBN:https://doi.org/10.48550/arXiv.2501.08497 / Published by ArXiv / Version released on 2025-01-15 / on (web) Publishing site
- Towards A Litmus Test for Common Sense / 2501.09913 / ISBN:https://doi.org/10.48550/arXiv.2501.09913 / Published by ArXiv / Version released on 2025-01-17 / on (web) Publishing site
- A Critical Field Guide for Working with Machine Learning Datasets / 2501.15491 / ISBN:https://doi.org/10.48550/arXiv.2501.15491 / Published by ArXiv / Version released on 2025-01-26 / on (web) Publishing site
- A Case Study in Acceleration AI Ethics: The TELUS GenAI Conversational Agent
/ 2501.18038 / ISBN:https://doi.org/10.48550/arXiv.2501.18038 / Published by ArXiv / Version released on 2025-03-26 / on (web) Publishing site
- Towards Safe AI Clinicians: A Comprehensive Study on Large Language Model Jailbreaking in Healthcare / 2501.18632 / ISBN:https://doi.org/10.48550/arXiv.2501.18632 / Published by ArXiv / Version released on 2025-01-27 / on (web) Publishing site
- Agentic AI: Expanding the Algorithmic Frontier of Creative Problem Solving / 2502.00289 / ISBN:https://doi.org/10.48550/arXiv.2502.00289 / Published by ArXiv / Version released on 2025-02-01 / on (web) Publishing site
- Agentic AI for Scaling Diagnosis and Care in Neurodegenerative Disease / 2502.06842 / ISBN:https://doi.org/10.48550/arXiv.2502.06842 / Published by ArXiv / Version released on 2025-12-23 / on (web) Publishing site
- Multi-Agent Risks from Advanced AI / 2502.14143 / ISBN:https://doi.org/10.48550/arXiv.2502.14143 / Published by ArXiv / Version released on 2025-02-19 / on (web) Publishing site
- On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective / 2502.14296 / ISBN:https://doi.org/10.48550/arXiv.2502.14296 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site
- Decoding the Black Box: Integrating Moral Imagination with Technical AI Governance / 2503.06411 / ISBN:https://doi.org/10.48550/arXiv.2503.06411 / Published by ArXiv / Version released on 2025-03-09 / on (web) Publishing site
- MinorBench: A hand-built benchmark for content-based risks for children / 2503.10242 / ISBN:https://doi.org/10.48550/arXiv.2503.10242 / Published by ArXiv / Version released on 2025-03-13 / on (web) Publishing site
- DarkBench: Benchmarking Dark Patterns in Large Language Models / 2503.10728 / ISBN:https://doi.org/10.48550/arXiv.2503.10728 / Published by ArXiv / Version released on 2025-03-13 / on (web) Publishing site
- A Peek Behind the Curtain: Using Step-Around Prompt Engineering to Identify Bias and Misinformation in GenAI Models / 2503.15205 / ISBN:https://doi.org/10.48550/arXiv.2503.15205 / Published by ArXiv / Version released on 2026-01-22 / on (web) Publishing site
- AI Family Integration Index (AFII): Benchmarking a New Global Readiness for AI as Family / 2503.22772 / ISBN:https://doi.org/10.48550/arXiv.2503.22772 / Published by ArXiv / Version released on 2025-03-28 / on (web) Publishing site
- Who is Responsible When AI Fails? Mapping Causes, Entities, and Consequences of AI Privacy and Ethical Incidents / 2504.01029 / ISBN:https://doi.org/10.48550/arXiv.2504.01029 / Published by ArXiv / Version released on 2025-09-18 / on (web) Publishing site
- Assessing employment and labour issues implicated by using AI
/ 2504.06322 / ISBN:https://doi.org/10.48550/arXiv.2504.06322 / Published by ArXiv / Version released on 2025-04-08 / on (web) Publishing site
- Towards interactive evaluations for interaction harms in human-AI systems / 2405.10632 / ISBN:https://doi.org/10.48550/arXiv.2405.10632 / Published by ArXiv / Version released on 2025-07-30 / on (web) Publishing site
- Confirmation Bias in Generative AI Chatbots: Mechanisms, Risks, Mitigation Strategies, and Future Research Directions / 2504.09343 / ISBN:https://doi.org/10.48550/arXiv.2504.09343 / Published by ArXiv / Version released on 2025-04-12 / on (web) Publishing site
- Towards responsible AI for education: Hybrid human-AI to confront the Elephant in the room / 2504.16148 / ISBN:https://doi.org/10.48550/arXiv.2504.16148 / Published by ArXiv / Version released on 2025-04-22 / on (web) Publishing site
- Auditing the Ethical Logic of Generative AI Models / 2504.17544 / ISBN:https://doi.org/10.48550/arXiv.2504.17544 / Published by ArXiv / Version released on 2025-04-24 / on (web) Publishing site
- AI Ethics and Social Norms: Exploring ChatGPT's Capabilities From What to How / 2504.18044 / ISBN:https://doi.org/10.48550/arXiv.2504.18044 / Published by ArXiv / Version released on 2025-04-25 / on (web) Publishing site
- AI Awareness / 2504.20084 / ISBN:https://doi.org/10.48550/arXiv.2504.20084 / Published by ArXiv / Version released on 2025-06-29 / on (web) Publishing site
- Exploring AI-powered Digital Innovations from A Transnational Governance Perspective: Implications for Market Acceptance and Digital Accountability Accountability / 2504.20215 / ISBN:https://doi.org/10.48550/arXiv.2504.20215 / Published by ArXiv / Version released on 2025-04-28 / on (web) Publishing site
- Generative AI in Financial Institution: A Global Survey of Opportunities, Threats, and Regulation / 2504.21574 / ISBN:https://doi.org/10.48550/arXiv.2504.21574 / Published by ArXiv / Version released on 2025-04-30 / on (web) Publishing site
- From Texts to Shields: Convergence of Large Language Models and Cybersecurity / 2505.00841 / ISBN:https://doi.org/10.48550/arXiv.2505.00841 / Published by ArXiv / Version released on 2025-05-01 / on (web) Publishing site
- Towards Safer Pretraining: Analyzing and Filtering Harmful Content in Webscale datasets for Responsible LLMs / 2505.02009 / ISBN:https://doi.org/10.48550/arXiv.2505.02009 / Published by ArXiv / Version released on 2025-08-12 / on (web) Publishing site
- Ethics and Persuasion in Reinforcement Learning from Human Feedback: A Procedural Rhetorical Approach / 2505.09576 / ISBN:https://doi.org/10.48550/arXiv.2505.09576 / Published by ArXiv / Version released on 2025-05-14 / on (web) Publishing site
- Formalising Human-in-the-Loop: Computational Reductions, Failure Modes, and Legal-Moral Responsibility / 2505.10426 / ISBN:https://doi.org/10.48550/arXiv.2505.10426 / Published by ArXiv / Version released on 2025-09-25 / on (web) Publishing site
- Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods / 2505.17870 / ISBN:https://doi.org/10.48550/arXiv.2505.17870 / Published by ArXiv / Version released on 2025-05-23 / on (web) Publishing site
- Toward a Cultural Co-Genesis of AI Ethics / 2505.21542 / ISBN:https://doi.org/10.48550/arXiv.2505.21542 / Published by ArXiv / Version released on 2025-05-24 / on (web) Publishing site
- Human-Centered Human-AI Collaboration (HCHAC) / 2505.22477 / ISBN:https://doi.org/10.48550/arXiv.2505.22477 / Published by ArXiv / Version released on 2025-05-28 / on (web) Publishing site
- SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents / 2505.23559 / ISBN:https://doi.org/10.48550/arXiv.2505.23559 / Published by ArXiv / Version released on 2025-05-29 / on (web) Publishing site
- DeepSeek in Healthcare: A Survey of Capabilities, Risks, and Clinical Applications of Open-Source Large Language Models / 2506.01257 / ISBN:https://doi.org/10.48550/arXiv.2506.01257 / Published by ArXiv / Version released on 2025-06-02 / on (web) Publishing site
- Whole-Person Education for AI Engineers / 2506.09185 / ISBN:https://doi.org/10.48550/arXiv.2506.09185 / Published by ArXiv / Version released on 2025-06-10 / on (web) Publishing site
- Subjective Experience in AI Systems: What Do AI Researchers and the Public Believe? / 2506.11945 / ISBN:https://doi.org/10.48550/arXiv.2506.11945 / Published by ArXiv / Version released on 2025-06-13 / on (web) Publishing site
- Feeling Machines: Ethics, Culture, and the Rise of Emotional AI / 2506.12437 / ISBN:https://doi.org/10.48550/arXiv.2506.12437 / Published by ArXiv / Version released on 2025-06-14 / on (web) Publishing site
- Foundation of Affective Computing and Interaction
/ 2506.15497 / ISBN:https://doi.org/10.48550/arXiv.2506.15497 / Published by ArXiv / Version released on 2025-06-18 / on (web) Publishing site
- AI Through the Human Lens: Investigating Cognitive Theories in Machine Psychology
/ 2506.18156 / ISBN:https://doi.org/10.48550/arXiv.2506.18156 / Published by ArXiv / Version released on 2025-11-07 / on (web) Publishing site
- Ask before you Build: Rethinking AI-for-Good in Human Trafficking Interventions / 2506.22512 / ISBN:https://doi.org/10.48550/arXiv.2506.22512 / Published by ArXiv / Version released on 2025-06-26 / on (web) Publishing site
- Moral Responsibility or Obedience: What Do We Want from AI? / 2507.02788 / ISBN:https://doi.org/10.48550/arXiv.2507.02788 / Published by ArXiv / Version released on 2025-07-03 / on (web) Publishing site
- AI Human Impact: Toward a Model for Ethical Investing in AI-Intensive Companies / 2507.07703 / ISBN:https://doi.org/10.48550/arXiv.2507.07703 / Published by ArXiv / Version released on 2025-07-10 / on (web) Publishing site
- Deepfake Technology Unveiled: The Commoditization of AI and Its Impact on Digital Trust / 2506.07363 / ISBN:https://doi.org/10.48550/arXiv.2506.07363 / Published by ArXiv / Version released on 2025-07-15 / on (web) Publishing site
- The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and Scientist / 2507.11810 / ISBN:https://doi.org/10.48550/arXiv.2507.11810 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site
- Exploiting Jailbreaking Vulnerabilities in Generative AI to Bypass Ethical Safeguards for Facilitating Phishing Attacks / 2507.12185 / ISBN:https://doi.org/10.48550/arXiv.2507.12185 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site
- Challenges of Trustworthy Federated Learning: What's Done, Current Trends and Remaining Work / 2507.15796 / ISBN:https://doi.org/10.48550/arXiv.2507.15796 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site
- ADEPTS: A Capability Framework for Human-Centered Agent Design / 2507.15885 / ISBN:https://doi.org/10.48550/arXiv.2507.15885 / Published by ArXiv / Version released on 2025-07-18 / on (web) Publishing site
- PRAC3 (Privacy, Reputation, Accountability, Consent, Credit, Compensation): Long Tailed Risks of Voice Actors in AI Data-Economy / 2507.16247 / ISBN:https://doi.org/10.48550/arXiv.2507.16247 / Published by ArXiv / Version released on 2025-07-22 / on (web) Publishing site
- EthicAlly: a Prototype for AI-Powered Research Ethics Support for the Social Sciences and Humanities / 2508.00856 / ISBN:https://doi.org/10.48550/arXiv.2508.00856 / Published by ArXiv / Version released on 2025-07-15 / on (web) Publishing site
- DIRF: A Framework for Digital Identity Protection and Clone Governance in Agentic AI Systems / 2508.01997 / ISBN:https://doi.org/10.48550/arXiv.2508.01997 / Published by ArXiv / Version released on 2025-09-08 / on (web) Publishing site
- Data and AI governance: Promoting equity, ethics, and fairness in large language models / 2508.03970 / ISBN:https://doi.org/10.48550/arXiv.2508.03970 / Published by ArXiv / Version released on 2025-08-05 / on (web) Publishing site
- A Moral Agency Framework for Legitimate Integration of AI in Bureaucracies / 2508.08231 / ISBN:https://doi.org/10.48550/arXiv.2508.08231 / Published by ArXiv / Version released on 2025-08-21 / on (web) Publishing site
- The Architecture of AI Transformation: Four Strategic Patterns and an Emerging Frontier / 2509.02853 / ISBN:https://doi.org/10.48550/arXiv.2509.02853 / Published by ArXiv / Version released on 2025-09-10 / on (web) Publishing site
- CAI Fluency: A Framework for Cybersecurity AI Fluency / 2508.13588 / ISBN:https://doi.org/10.48550/arXiv.2508.13588 / Published by ArXiv / Version released on 2025-10-07 / on (web) Publishing site
- The AI-Fraud Diamond: A Novel Lens for Auditing Algorithmic Deception / 2508.13984 / ISBN:https://doi.org/10.48550/arXiv.2508.13984 / Published by ArXiv / Version released on 2025-08-19 / on (web) Publishing site
- Augmentation Technologies and AI - An Ethical Design Futures Framework / 2508.16615 / ISBN:https://doi.org/10.48550/arXiv.2508.16615 / Published by ArXiv / Version released on 2025-08-13 / on (web) Publishing site
- Towards Enhancing Data Equity in Public Health Data Science / 2508.20301 / ISBN:https://doi.org/10.48550/arXiv.2508.20301 / Published by ArXiv / Version released on 2025-08-27 / on (web) Publishing site
- Leveraging Imperfection with MEDLEY A Multi-Model Approach Harnessing Bias in Medical AI / 2508.21648 / ISBN:https://doi.org/10.48550/arXiv.2508.21648 / Published by ArXiv / Version released on 2025-08-29 / on (web) Publishing site
- A Study on the Framework for Evaluating the Ethics and Trustworthiness of Generative AI / 2509.00398 / ISBN:https://doi.org/10.48550/arXiv.2509.00398 / Published by ArXiv / Version released on 2025-10-28 / on (web) Publishing site
- Structured AI Decision-Making in Disaster Management / 2509.01576 / ISBN:https://doi.org/10.48550/arXiv.2509.01576 / Published by ArXiv / Version released on 2025-09-01 / on (web) Publishing site
- Evaluating the Clinical Safety of LLMs in Response to High-Risk Mental Health Disclosures / 2509.08839 / ISBN:https://doi.org/10.48550/arXiv.2509.08839 / Published by ArXiv / Version released on 2025-09-01 / on (web) Publishing site
- The Ultimate Test of Superintelligent AI Agents: Can an AI Balance Care and Control in Asymmetric Relationships? / 2506.01813 / ISBN:https://doi.org/10.48550/arXiv.2506.01813 / Published by ArXiv / Version released on 2025-09-29 / on (web) Publishing site
- Trust and Transparency in AI: Industry Voices on Data, Ethics, and Compliance / 2509.22709 / ISBN:https://doi.org/10.48550/arXiv.2509.22709 / Published by ArXiv / Version released on 2025-09-23 / on (web) Publishing site
- AI Alignment vs. AI Ethical Treatment: 10 Challenges / 2510.12844 / ISBN:https://doi.org/10.48550/arXiv.2510.12844 / Published by ArXiv / Version released on 2025-10-14 / on (web) Publishing site
- Trust in foundation models and GenAI: A geographic perspective / 2510.17942 / ISBN:https://doi.org/10.48550/arXiv.2510.17942 / Published by ArXiv / Version released on 2025-10-20 / on (web) Publishing site
- Systematizing LLM Persona Design: A Four-Quadrant Technical Taxonomy for AI Companion Applications / 2511.02979 / ISBN:https://doi.org/10.48550/arXiv.2511.02979 / Published by ArXiv / Version released on 2026-01-23 / on (web) Publishing site
- Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming / 2511.15998 / ISBN:https://doi.org/10.48550/arXiv.2511.15998 / Published by ArXiv / Version released on 2025-11-21 / on (web) Publishing site
- Knowing Ourselves Through Others: Reflecting with AI in Digital Human Debates / 2511.13046 / ISBN:https://doi.org/10.48550/arXiv.2511.13046 / Published by ArXiv / Version released on 2025-11-17 / on (web) Publishing site
- The Making of Digital Ghosts: Designing Ethical AI Afterlives / 2511.20094 / ISBN:https://doi.org/10.48550/arXiv.2511.20094 / Published by ArXiv / Version released on 2025-11-25 / on (web) Publishing site
- Medical Malice: A Dataset for Context-Aware Safety in Healthcare LLMs / 2511.21757 / ISBN:https://doi.org/10.48550/arXiv.2511.21757 / Published by ArXiv / Version released on 2025-11-24 / on (web) Publishing site
- AGENTSAFE: A Unified Framework for Ethical Assurance and Governance in Agentic AI / 2512.03180 / ISBN:https://doi.org/10.48550/arXiv.2512.03180 / Published by ArXiv / Version released on 2025-12-02 / on (web) Publishing site
- The Decision Path to Control AI Risks Completely: Fundamental Control Mechanisms for AI Governance / 2512.04489 / ISBN:https://doi.org/10.48550/arXiv.2512.04489 / Published by ArXiv / Version released on 2025-12-24 / on (web) Publishing site
- Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research / 2512.10058 / ISBN:https://doi.org/10.48550/arXiv.2512.10058 / Published by ArXiv / Version released on 2025-12-10 / on (web) Publishing site
- A Framework for Responsible AI Systems: Building Societal Trust through Domain Definition, Trustworthy AI Design, Auditability, Accountability, and Governance / 2503.04739 / ISBN:https://doi.org/10.48550/arXiv.2503.04739 / Published by ArXiv / Version released on 2026-01-08 / on (web) Publishing site
- Legal Alignment for Safe and Ethical AI / 2601.04175 / ISBN:https://doi.org/10.48550/arXiv.2601.04175 / Published by ArXiv / Version released on 2026-01-07 / on (web) Publishing site
- The Psychology of Learning from Machines: Anthropomorphic AI and the Paradox of Automation in Education / 2601.06172 / ISBN:https://doi.org/10.48550/arXiv.2601.06172 / Published by ArXiv / Version released on 2026-02-03 / on (web) Publishing site
- Navigating Ethical AI Challenges in the Industrial Sector: Balancing Innovation and Responsibility / 2601.09351 / ISBN:https://doi.org/10.48550/arXiv.2601.09351 / Published by ArXiv / Version released on 2026-01-14 / on (web) Publishing site
- Human Society-Inspired Approaches to Agentic AI Security: The 4C Framework / 2602.01942 / ISBN:https://doi.org/10.48550/arXiv.2602.01942 / Published by ArXiv / Version released on 2026-02-02 / on (web) Publishing site
- Futuring Social Assemblages: How Enmeshing AIs into Social Life Challenges the Individual and the Interpersonal / 2602.03958 / ISBN:https://doi.org/10.48550/arXiv.2602.03958 / Published by ArXiv / Version released on 2026-02-03 / on (web) Publishing site
- Insidious Imaginaries: A Critical Overview of AI Speculations / 2602.17383 / ISBN:https://doi.org/10.34626/2025_xcoax_001 / Version released on 2026-02-19 / on (web) Publishing site
- Dark and Bright Side of Participatory Red-Teaming with Targets of Stereotyping for Eliciting Harmful Behaviors from Large Language Models / 2602.19124 / ISBN:https://doi.org/10.48550/arXiv.2602.19124 / Version released on 2026-02-22 / on (web) Publishing site
- Must Read: A Comprehensive Survey of Computational Persuasion / 2505.07775 / ISBN:https://doi.org/10.48550/arXiv.2505.07775 / Version released on 2026-03-23 / on (web) Publishing site
- The Landscape of Generative AI in Information Systems: A Synthesis of Secondary Reviews and Research Agendas / 2603.11842 / ISBN:https://doi.org/10.48550/arXiv.2603.11842 / Version released on 2026-03-12 / on (web) Publishing site
- Ghosting the Machine: Stop Calling Human-Agent Relations Parasocial / 2604.05197 / ISBN:https://doi.org/10.48550/arXiv.2604.05197 / Version released on 2026-04-06 / on (web) Publishing site
_