if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long
if you modify the keywords, press enter within the field to confirm the new search key
Tag: finetuning
Bibliography items where occurs: 312
- The AI Index 2022 Annual Report / 2205.03468 / ISBN:https://doi.org/10.48550/arXiv.2205.03468 / Published by ArXiv / Version released on 2022-05-02 / on (web) Publishing site
- On the Current and Emerging Challenges of Developing Fair and Ethical AI Solutions in Financial Services / 2111.01306 / ISBN:https://doi.org/10.48550/arXiv.2111.01306 / Published by ArXiv / Version released on 2021-11-02 / on (web) Publishing site
- A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation / 2305.11391 / ISBN:https://doi.org/10.48550/arXiv.2305.11391 / Published by ArXiv / Version released on 2023-08-27 / on (web) Publishing site
- Getting pwn'd by AI: Penetration Testing with Large Language Models / 2308.00121 / ISBN:https://doi.org/10.48550/arXiv.2308.00121 / Published by ArXiv / Version released on 2023-08-17 / on (web) Publishing site
- Building Trust in Conversational AI: A Comprehensive Review and Solution Architecture for Explainable, Privacy-Aware Systems using LLMs and Knowledge Graph / 2308.13534 / ISBN:https://doi.org/10.48550/arXiv.2308.13534 / Published by ArXiv / Version released on 2023-08-13 / on (web) Publishing site
- Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories? / 2308.15399 / ISBN:https://doi.org/10.48550/arXiv.2308.15399 / Published by ArXiv / Version released on 2024-07-01 / on (web) Publishing site
- Pathway to Future Symbiotic Creativity / 2209.02388 / ISBN:https://doi.org/10.48550/arXiv.2209.02388 / Published by ArXiv / Version released on 2023-09-13 / on (web) Publishing site
- The Cambridge Law Corpus: A Corpus for Legal AI Research / 2309.12269 / ISBN:https://doi.org/10.48550/arXiv.2309.12269 / Published by ArXiv / Version released on 2024-01-01 / on (web) Publishing site
- Security Considerations in AI-Robotics: A Survey of Current Methods, Challenges, and Opportunities / 2310.08565 / ISBN:https://doi.org/10.48550/arXiv.2310.08565 / Published by ArXiv / Version released on 2024-01-26 / on (web) Publishing site
- A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics / 2310.05694 / ISBN:https://doi.org/10.48550/arXiv.2310.05694 / Published by ArXiv / Version released on 2025-01-27 / on (web) Publishing site
- STREAM: Social data and knowledge collective intelligence platform for TRaining Ethical AI Models / 2310.05563 / ISBN:https://doi.org/10.48550/arXiv.2310.05563 / Published by ArXiv / Version released on 2023-10-09 / on (web) Publishing site
- AI & Blockchain as sustainable teaching and learning tools to cope with the 4IR / 2305.01088 / ISBN:https://doi.org/10.48550/arXiv.2305.01088 / Published by ArXiv / Version released on 2023-09-17 / on (web) Publishing site
- FUTURE-AI: International consensus guideline for trustworthy and deployable artificial intelligence in healthcare / 2309.12325 / ISBN:https://doi.org/10.48550/arXiv.2309.12325 / Published by ArXiv / Version released on 2024-07-08 / on (web) Publishing site
- Specific versus General Principles for Constitutional AI / 2310.13798 / ISBN:https://doi.org/10.48550/arXiv.2310.13798 / Published by ArXiv / Version released on 2023-10-20 / on (web) Publishing site
- Unpacking the Ethical Value Alignment in Big Models / 2310.17551 / ISBN:https://doi.org/10.48550/arXiv.2310.17551 / Published by ArXiv / Version released on 2023-10-26 / on (web) Publishing site
- Kantian Deontology Meets AI Alignment: Towards Morally Grounded Fairness Metrics / 2311.05227 / ISBN:https://doi.org/10.48550/arXiv.2311.05227 / Published by ArXiv / Version released on 2024-02-26 / on (web) Publishing site
- Unlocking the Potential of ChatGPT: A Comprehensive Exploration of its Applications, Advantages, Limitations, and Future Directions in Natural Language Processing / 2304.02017 / ISBN:https://doi.org/10.48550/arXiv.2304.02017 / Published by ArXiv / Version released on 2024-08-03 / on (web) Publishing site
- A Brief History of Prompt: Leveraging Language Models. (Through Advanced Prompting) / 2310.04438 / ISBN:https://doi.org/10.48550/arXiv.2310.04438 / Published by ArXiv / Version released on 2023-11-28 / on (web) Publishing site
- Synergizing Human-AI Agency: A Guide of 23 Heuristics for Service Co-Creation with LLM-Based Agents / 2310.15065 / ISBN:https://doi.org/10.48550/arXiv.2310.15065 / Published by ArXiv / Version released on 2023-11-29 / on (web) Publishing site
- She had Cobalt Blue Eyes: Prompt Testing to Create Aligned and Sustainable Language Models / 2310.18333 / ISBN:https://doi.org/10.48550/arXiv.2310.18333 / Published by ArXiv / Version released on 2023-12-15 / on (web) Publishing site
- How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities / 2311.09447 / ISBN:https://doi.org/10.48550/arXiv.2311.09447 / Published by ArXiv / Version released on 2024-04-02 / on (web) Publishing site
- Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown / 2311.09731 / ISBN:https://doi.org/10.48550/arXiv.2311.09731 / Published by ArXiv / Version released on 2023-11-16 / on (web) Publishing site
- Revolutionizing Customer Interactions: Insights and Challenges in Deploying ChatGPT and Generative Chatbots for FAQs / 2311.09976 / ISBN:https://doi.org/10.48550/arXiv.2311.09976 / Published by ArXiv / Version released on 2023-11-16 / on (web) Publishing site
- Case Repositories: Towards Case-Based Reasoning for AI Alignment / 2311.10934 / ISBN:https://doi.org/10.48550/arXiv.2311.10934 / Published by ArXiv / Version released on 2023-11-26 / on (web) Publishing site
- GPT in Data Science: A Practical Exploration of Model Selection / 2311.11516 / ISBN:https://doi.org/10.48550/arXiv.2311.11516 / Published by ArXiv / Version released on 2023-11-20 / on (web) Publishing site
- Large Language Models in Education: Vision and Opportunities / 2311.13160 / ISBN:https://doi.org/10.48550/arXiv.2311.13160 / Published by ArXiv / Version released on 2023-11-22 / on (web) Publishing site
- Towards Auditing Large Language Models: Improving Text-based Stereotype Detection / 2311.14126 / ISBN:https://doi.org/10.48550/arXiv.2311.14126 / Published by ArXiv / Version released on 2023-11-23 / on (web) Publishing site
- Survey on AI Ethics: A Socio-technical Perspective / 2311.17228 / ISBN:https://doi.org/10.48550/arXiv.2311.17228 / Published by ArXiv / Version released on 2025-11-04 / on (web) Publishing site
- From Lab to Field: Real-World Evaluation of an AI-Driven Smart Video Solution to Enhance Community Safety / 2312.02078 / ISBN:https://doi.org/10.48550/arXiv.2312.02078 / Published by ArXiv / Version released on 2025-08-12 / on (web) Publishing site
- Disentangling Perceptions of Offensiveness: Cultural and Moral Correlates / 2312.06861 / ISBN:https://doi.org/10.48550/arXiv.2312.06861 / Published by ArXiv / Version released on 2023-12-11 / on (web) Publishing site
- The AI Assessment Scale (AIAS): A Framework for Ethical Integration of Generative AI in Educational Assessment / 2312.07086 / ISBN:https://doi.org/10.48550/arXiv.2312.07086 / Published by ArXiv / Version released on 2024-04-24 / on (web) Publishing site
- Investigating Responsible AI for Scientific Research: An Empirical Study / 2312.09561 / ISBN:https://doi.org/10.48550/arXiv.2312.09561 / Published by ArXiv / Version released on 2023-12-15 / on (web) Publishing site
- Autonomous Threat Hunting: A Future Paradigm for AI-Driven Threat Intelligence / 2401.00286 / ISBN:https://doi.org/10.48550/arXiv.2401.00286 / Published by ArXiv / Version released on 2023-12-30 / on (web) Publishing site
- Exploring the Frontiers of LLMs in Psychological Applications: A Comprehensive Review / 2401.01519 / ISBN:https://doi.org/10.48550/arXiv.2401.01519 / Published by ArXiv / Version released on 2025-04-20 / on (web) Publishing site
- Synthetic Data in AI: Challenges, Applications, and Ethical Implications / 2401.01629 / ISBN:https://doi.org/10.48550/arXiv.2401.01629 / Published by ArXiv / Version released on 2024-01-03 / on (web) Publishing site
- MULTI-CASE: A Transformer-based Ethics-aware Multimodal Investigative Intelligence Framework / 2401.01955 / ISBN:https://doi.org/10.48550/arXiv.2401.01955 / Published by ArXiv / Version released on 2024-01-03 / on (web) Publishing site
- Business and ethical concerns in domestic Conversational Generative AI-empowered multi-robot systems / 2401.09473 / ISBN:https://doi.org/10.48550/arXiv.2401.09473 / Published by ArXiv / Version released on 2024-01-12 / on (web) Publishing site
- FAIR Enough How Can We Develop and Assess a FAIR-Compliant Dataset for Large Language Models' Training? / 2401.11033 / ISBN:https://doi.org/10.48550/arXiv.2401.11033 / Published by ArXiv / Version released on 2024-04-03 / on (web) Publishing site
- Enabling Global Image Data Sharing in the Life Sciences / 2401.13023 / ISBN:https://doi.org/10.48550/arXiv.2401.13023 / Published by ArXiv / Version released on 2024-02-02 / on (web) Publishing site
- Beyond principlism: Practical strategies for ethical AI use in research practices / 2401.15284 / ISBN:https://doi.org/10.48550/arXiv.2401.15284 / Published by ArXiv / Version released on 2025-06-20 / on (web) Publishing site
- Detecting Multimedia Generated by Large AI Models: A Survey / 2402.00045 / ISBN:https://doi.org/10.48550/arXiv.2402.00045 / Published by ArXiv / Version released on 2025-07-26 / on (web) Publishing site
- Commercial AI, Conflict, and Moral Responsibility: A theoretical analysis and practical approach to the moral responsibilities associated with dual-use AI technology / 2402.01762 / ISBN:https://doi.org/10.48550/arXiv.2402.01762 / Published by ArXiv / Version released on 2024-01-30 / on (web) Publishing site
- (A)I Am Not a Lawyer, But...: Engaging Legal Experts towards Responsible LLM Policies for Legal Advice / 2402.01864 / ISBN:https://doi.org/10.48550/arXiv.2402.01864 / Published by ArXiv / Version released on 2024-02-02 / on (web) Publishing site
- User Modeling and User Profiling: A Comprehensive Survey / 2402.09660 / ISBN:https://doi.org/10.48550/arXiv.2402.09660 / Published by ArXiv / Version released on 2024-02-20 / on (web) Publishing site
- Envisioning the Applications and Implications of Generative AI for News Media / 2402.18835 / ISBN:https://doi.org/10.48550/arXiv.2402.18835 / Published by ArXiv / Version released on 2024-02-29 / on (web) Publishing site
- The Minimum Information about CLinical Artificial Intelligence Checklist for Generative Modeling Research (MI-CLAIM-GEN) / 2403.02558 / ISBN:https://doi.org/10.48550/arXiv.2403.02558 / Published by ArXiv / Version released on 2024-07-12 / on (web) Publishing site
- A Survey on Human-AI Collaboration with Large Foundation Models / 2403.04931 / ISBN:https://doi.org/10.48550/arXiv.2403.04931 / Published by ArXiv / Version released on 2025-09-02 / on (web) Publishing site
- The Pursuit of Fairness in Artificial Intelligence Models A Survey / 2403.17333 / ISBN:https://doi.org/10.48550/arXiv.2403.17333 / Published by ArXiv / Version released on 2024-03-26 / on (web) Publishing site
- Exploring the Nexus of Large Language Models and Legal Systems: A Short Survey / 2404.00990 / ISBN:https://doi.org/10.48550/arXiv.2404.00990 / Published by ArXiv / Version released on 2024-04-01 / on (web) Publishing site
- A Review of Multi-Modal Large Language and Vision Models / 2404.01322 / ISBN:https://doi.org/10.48550/arXiv.2404.01322 / Published by ArXiv / Version released on 2024-03-28 / on (web) Publishing site
- Is Your AI Truly Yours? Leveraging Blockchain for Copyrights, Provenance, and Lineage
/ 2404.06077 / ISBN:https://doi.org/10.48550/arXiv.2404.06077 / Published by ArXiv / Version released on 2025-07-07 / on (web) Publishing site
- Frontier AI Ethics: Anticipating and Evaluating the Societal Impacts of Language Model Agents / 2404.06750 / ISBN:https://arxiv.org/abs/2404.06750 / Published by ArXiv / Version released on 2024-10-18 / on (web) Publishing site
- AI Alignment: A Comprehensive Survey / 2310.19852 / ISBN:https://doi.org/10.48550/arXiv.2310.19852 / Published by ArXiv / Version released on 2025-04-04 / on (web) Publishing site
- PoliTune: Analyzing the Impact of Data Selection and Fine-Tuning on Economic and Political Biases in Large Language Models / 2404.08699 / ISBN:https://doi.org/10.48550/arXiv.2404.08699 / Published by ArXiv / Version released on 2024-07-27 / on (web) Publishing site
- Detecting AI Generated Text Based on NLP and Machine Learning Approaches / 2404.10032 / ISBN:https://doi.org/10.48550/arXiv.2404.10032 / Published by ArXiv / Version released on 2024-04-15 / on (web) Publishing site
- Large Language Model Supply Chain: A Research Agenda / 2404.12736 / ISBN:https://doi.org/10.48550/arXiv.2404.12736 / Published by ArXiv / Version released on 2024-04-19 / on (web) Publishing site
- The Necessity of AI Audit Standards Boards / 2404.13060 / ISBN:https://doi.org/10.48550/arXiv.2404.13060 / Published by ArXiv / Version released on 2024-04-11 / on (web) Publishing site
- Modeling Emotions and Ethics with Large Language Models / 2404.13071 / ISBN:https://doi.org/10.48550/arXiv.2404.13071 / Published by ArXiv / Version released on 2024-06-25 / on (web) Publishing site
- Beyond Personhood: Agency, Accountability, and the Limits of Anthropomorphic Ethical Analysis / 2404.13861 / ISBN:https://doi.org/10.48550/arXiv.2404.13861 / Published by ArXiv / Version released on 2024-04-22 / on (web) Publishing site
- A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law / 2405.01769 / ISBN:https://doi.org/10.48550/arXiv.2405.01769 / Published by ArXiv / Version released on 2024-11-21 / on (web) Publishing site
- A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and Generative AI / 2405.04333 / ISBN:https://doi.org/10.48550/arXiv.2405.04333 / Published by ArXiv / Version released on 2024-05-07 / on (web) Publishing site
- Trustworthy AI-Generative Content in Intelligent 6G Network: Adversarial, Privacy, and Fairness / 2405.05930 / ISBN:https://doi.org/10.48550/arXiv.2405.05930 / Published by ArXiv / Version released on 2024-05-09 / on (web) Publishing site
- Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback / 2404.10271 / ISBN:https://doi.org/10.48550/arXiv.2404.10271 / Published by ArXiv / Version released on 2024-06-04 / on (web) Publishing site
- A scoping review of using Large Language Models (LLMs) to investigate Electronic Health Records (EHRs) / 2405.03066 / ISBN:https://doi.org/10.48550/arXiv.2405.03066 / Published by ArXiv / Version released on 2024-05-22 / on (web) Publishing site
- The Narrow Depth and Breadth of Corporate Responsible AI Research / 2405.12193 / ISBN:https://doi.org/10.48550/arXiv.2405.12193 / Published by ArXiv / Version released on 2026-01-28 / on (web) Publishing site
- A Comprehensive Overview of Large Language Models (LLMs) for Cyber Defences: Opportunities and Directions / 2405.14487 / ISBN:https://doi.org/10.48550/arXiv.2405.14487 / Published by ArXiv / Version released on 2024-05-23 / on (web) Publishing site
- Towards Clinical AI Fairness: Filling Gaps in the Puzzle / 2405.17921 / ISBN:https://doi.org/10.48550/arXiv.2405.17921 / Published by ArXiv / Version released on 2024-05-28 / on (web) Publishing site
- The AI Alignment Paradox / 2405.20806 / ISBN:https://doi.org/10.48550/arXiv.2405.20806 / Published by ArXiv / Version released on 2024-11-22 / on (web) Publishing site
- Gender Bias Detection in Court Decisions: A Brazilian Case Study / 2406.00393 / ISBN:https://doi.org/10.48550/arXiv.2406.00393 / Published by ArXiv / Version released on 2024-06-01 / on (web) Publishing site
- Transforming Computer Security and Public Trust Through the Exploration of Fine-Tuning Large Language Models / 2406.00628 / ISBN:https://doi.org/10.48550/arXiv.2406.00628 / Published by ArXiv / Version released on 2024-06-02 / on (web) Publishing site
- How Ethical Should AI Be? How AI Alignment Shapes the Risk Preferences of LLMs / 2406.01168 / ISBN:https://doi.org/10.48550/arXiv.2406.01168 / Published by ArXiv / Version released on 2024-08-01 / on (web) Publishing site
- The Impact of AI on Academic Research and Publishing / 2406.06009 / Published by ArXiv / Version released on 2024-06-10 / on (web) Publishing site
- An Empirical Design Justice Approach to Identifying Ethical Considerations in the Intersection of Large Language Models and Social Robotics / 2406.06400 / ISBN:https://doi.org/10.48550/arXiv.2406.06400 / Published by ArXiv / Version released on 2024-06-12 / on (web) Publishing site
- The Ethics of Interaction: Mitigating Security Threats in LLMs / 2401.12273 / ISBN:https://doi.org/10.48550/arXiv.2401.12273 / Published by ArXiv / Version released on 2024-07-10 / on (web) Publishing site
- Global AI Governance in Healthcare: A Cross-Jurisdictional Regulatory Analysis / 2406.08695 / ISBN:https://doi.org/10.48550/arXiv.2406.08695 / Published by ArXiv / Version released on 2024-06-12 / on (web) Publishing site
- Federated Learning driven Large Language Models for Swarm Intelligence: A Survey / 2406.09831 / ISBN:https://doi.org/10.48550/arXiv.2406.09831 / Published by ArXiv / Version released on 2024-06-14 / on (web) Publishing site
- Applications of Generative AI in Healthcare: algorithmic, ethical, legal and societal considerations / 2406.10632 / ISBN:https://doi.org/10.48550/arXiv.2406.10632 / Published by ArXiv / Version released on 2024-06-15 / on (web) Publishing site
- Current state of LLM Risks and AI Guardrails / 2406.12934 / ISBN:https://doi.org/10.48550/arXiv.2406.12934 / Published by ArXiv / Version released on 2024-06-16 / on (web) Publishing site
- Leveraging Large Language Models for Patient Engagement: The Power of Conversational AI in Digital Health
/ 2406.13659 / ISBN:https://doi.org/10.48550/arXiv.2406.13659 / Published by ArXiv / Version released on 2024-06-19 / on (web) Publishing site
- Documenting Ethical Considerations in Open Source AI Models / 2406.18071 / ISBN:https://doi.org/10.48550/arXiv.2406.18071 / Published by ArXiv / Version released on 2024-07-03 / on (web) Publishing site
- AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations / 2406.18346 / ISBN:https://doi.org/10.48550/arXiv.2406.18346 / Published by ArXiv / Version released on 2024-06-26 / on (web) Publishing site
- Staying vigilant in the Age of AI: From content generation to content authentication / 2407.00922 / ISBN:https://doi.org/10.48550/arXiv.2407.00922 / Published by ArXiv / Version released on 2024-07-01 / on (web) Publishing site
- A Blueprint for Auditing Generative AI / 2407.05338 / ISBN:https://doi.org/10.48550/arXiv.2407.05338 / Published by ArXiv / Version released on 2024-07-07 / on (web) Publishing site
- Bridging the Global Divide in AI Regulation: A Proposal for a Contextual, Coherent, and Commensurable Framework / 2303.11196 / ISBN:https://doi.org/10.48550/arXiv.2303.11196 / Published by ArXiv / Version released on 2024-07-15 / on (web) Publishing site
- Generative AI for Health Technology Assessment: Opportunities, Challenges, and Policy Considerations / 2407.11054 / ISBN:https://doi.org/10.48550/arXiv.2407.11054 / Published by ArXiv / Version released on 2024-09-21 / on (web) Publishing site
- Thorns and Algorithms: Navigating Generative AI Challenges Inspired by Giraffes and Acacias / 2407.11360 / ISBN:https://doi.org/10.48550/arXiv.2407.11360 / Published by ArXiv / Version released on 2024-07.16 / on (web) Publishing site
- Prioritizing High-Consequence Biological Capabilities in Evaluations of Artificial Intelligence Models / 2407.13059 / ISBN:https://doi.org/10.48550/arXiv.2407.13059 / Published by ArXiv / Version released on 2024-07-23 / on (web) Publishing site
- Assurance of AI Systems From a Dependability Perspective / 2407.13948 / ISBN:https://doi.org/10.48550/arXiv.2407.13948 / Published by ArXiv / Version released on 2024-08-07 / on (web) Publishing site
- Open Artificial Knowledge / 2407.14371 / ISBN:https://doi.org/10.48550/arXiv.2407.14371 / Published by ArXiv / Version released on 2024-07-19 / on (web) Publishing site
- RogueGPT: dis-ethical tuning transforms ChatGPT4 into a Rogue AI in 158 Words / 2407.15009 / ISBN:https://doi.org/10.48550/arXiv.2407.15009 / Published by ArXiv / Version released on 2024-07-23 / on (web) Publishing site
- Deepfake Media Forensics: State of the Art and Challenges Ahead / 2408.00388 / ISBN:https://doi.org/10.48550/arXiv.2408.00388 / Published by ArXiv / Version released on 2024-08-13 / on (web) Publishing site
- Improving Large Language Model (LLM) fidelity through context-aware grounding: A systematic approach to reliability and veracity / 2408.04023 / ISBN:https://doi.org/10.48550/arXiv.2408.04023 / Published by ArXiv / Version released on 2024-08-07 / on (web) Publishing site
- The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources / 2406.16746 / ISBN:https://doi.org/10.48550/arXiv.2406.16746 / Published by ArXiv / Version released on 2024-09-03 / on (web) Publishing site
- Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives / 2407.14962 / ISBN:https://doi.org/10.48550/arXiv.2407.14962 / Published by ArXiv / Version released on 2024-08-23 / on (web) Publishing site
- Don't Kill the Baby: The Case for AI in Arbitration / 2408.11608 / ISBN:https://doi.org/10.48550/arXiv.2408.11608 / Published by ArXiv / Version released on 2025-03-22 / on (web) Publishing site
- CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical Researcher / 2408.11650 / ISBN:https://doi.org/10.48550/arXiv.2408.11650 / Published by ArXiv / Version released on 2024-11-06 / on (web) Publishing site
- The Problems with Proxies: Making Data Work Visible through Requester Practices / 2408.11667 / ISBN:https://doi.org/10.48550/arXiv.2408.11667 / Published by ArXiv / Version released on 2024-08-21 / on (web) Publishing site
- Catalog of General Ethical Requirements for AI Certification / 2408.12289 / ISBN:https://doi.org/10.48550/arXiv.2408.12289 / Published by ArXiv / Version released on 2024-11-15 / on (web) Publishing site
- Is Generative AI the Next Tactical Cyber Weapon For Threat Actors? Unforeseen Implications of AI Generated Cyber Attacks / 2408.12806 / ISBN:https://doi.org/10.48550/arXiv.2408.12806 / Published by ArXiv / Version released on 2024-08-23 / on (web) Publishing site
- Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey / 2408.12880 / ISBN:https://doi.org/10.48550/arXiv.2408.12880 / Published by ArXiv / Version released on 2024-08-23 / on (web) Publishing site
- A Survey for Large Language Models in Biomedicine / 2409.00133 / ISBN:https://doi.org/10.48550/arXiv.2409.00133 / Published by ArXiv / Version released on 2024-08-29 / on (web) Publishing site
- Digital Homunculi: Reimagining Democracy Research with Generative Agents / 2409.00826 / ISBN:https://doi.org/10.48550/arXiv.2409.00826 / Published by ArXiv / Version released on 2024-09-01 / on (web) Publishing site
- DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection / 2409.06072 / ISBN:https://doi.org/10.48550/arXiv.2409.06072 / Published by ArXiv / Version released on 2024-09-09 / on (web) Publishing site
- On the Creativity of Large Language Models / 2304.00008 / ISBN:https://doi.org/10.48550/arXiv.2304.00008 / Published by ArXiv / Version released on 2024-09-18 / on (web) Publishing site
- LLM generated responses to mitigate the impact of hate speech / 2311.16905 / ISBN:https://doi.org/10.48550/arXiv.2311.16905 / Published by ArXiv / Version released on 2024-10-02 / on (web) Publishing site
- Data-Centric Foundation Models in Computational Healthcare: A Survey / 2401.02458 / ISBN:https://doi.org/10.48550/arXiv.2401.02458 / Published by ArXiv / Version released on 2026-04-29 / on (web) Publishing site
- Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models / 2401.16727 / ISBN:https://doi.org/10.48550/arXiv.2401.16727 / Published by ArXiv / Version released on 2024-10-30 / on (web) Publishing site
- Large language models as linguistic simulators and cognitive models in human research / 2402.04470 / ISBN:https://doi.org/10.48550/arXiv.2402.04470 / Published by ArXiv / Version released on 2024-10-20 / on (web) Publishing site
- Navigating LLM Ethics: Advancements, Challenges, and Future Directions / 2406.18841 / ISBN:https://doi.org/10.48550/arXiv.2406.18841 / Published by ArXiv / Version released on 2025-06-15 / on (web) Publishing site
- Improving governance outcomes through AI documentation: Bridging theory and practice / 2409.08960 / ISBN:https://doi.org/10.48550/arXiv.2409.08960 / Published by ArXiv / Version released on 2024-12-09 / on (web) Publishing site
- XTRUST: On the Multilingual Trustworthiness of Large Language Models / 2409.15762 / ISBN:https://doi.org/10.48550/arXiv.2409.15762 / Published by ArXiv / Version released on 2024-09-24 / on (web) Publishing site
- Artificial Human Intelligence: The role of Humans in the Development of Next Generation AI
/ 2409.16001 / ISBN:https://doi.org/10.48550/arXiv.2409.16001 / Published by ArXiv / Version released on 2025-12-05 / on (web) Publishing site
- Ethical and Scalable Automation: A Governance and Compliance Framework for Business Applications / 2409.16872 / ISBN:https://doi.org/10.48550/arXiv.2409.16872 / Published by ArXiv / Version released on 2024-12-05 / on (web) Publishing site
- Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions / 2409.16974 / ISBN:https://doi.org/10.48550/arXiv.2409.16974 / Published by ArXiv / Version released on 2024-09-25 / on (web) Publishing site
- Safety challenges of AI in medicine / 2409.18968 / ISBN:https://doi.org/10.48550/arXiv.2409.18968 / Published by ArXiv / Version released on 2024-09-11 / on (web) Publishing site
- Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure / 2409.19104 / ISBN:https://doi.org/10.48550/arXiv.2409.19104 / Published by ArXiv / Version released on 2024-09-27 / on (web) Publishing site
- Clinnova Federated Learning Proof of Concept: Key Takeaways from a Cross-border Collaboration / 2410.02443 / ISBN:https://doi.org/10.48550/arXiv.2410.02443 / Published by ArXiv / Version released on 2024-10-03 / on (web) Publishing site
- AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models / 2410.07561 / ISBN:https://doi.org/10.48550/arXiv.2410.07561 / Published by ArXiv / Version released on 2024-12-12 / on (web) Publishing site
- Navigating the Cultural Kaleidoscope: A Hitchhiker's Guide to Sensitivity in Large Language Models
/ 2410.12880 / ISBN:https://doi.org/10.48550/arXiv.2410.12880 / Published by ArXiv / Version released on 2025-01-24 / on (web) Publishing site
- Data Defenses Against Large Language Models / 2410.13138 / ISBN:https://doi.org/10.48550/arXiv.2410.13138 / Published by ArXiv / Version released on 2024-10-17 / on (web) Publishing site
- Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems / 2410.13334 / ISBN:https://doi.org/10.48550/arXiv.2410.13334 / Published by ArXiv / Version released on 2024-10-23 / on (web) Publishing site
- Jailbreaking and Mitigation of Vulnerabilities in Large Language Models / 2410.15236 / ISBN:https://doi.org/10.48550/arXiv.2410.15236 / Published by ArXiv / Version released on 2025-11-25 / on (web) Publishing site
- Demystifying Large Language Models for Medicine: A Primer / 2410.18856 / ISBN:https://doi.org/10.48550/arXiv.2410.18856 / Published by ArXiv / Version released on 2024-11-20 / on (web) Publishing site
- The Dark Side of AI Companionship: A Taxonomy of Harmful Algorithmic Behaviors in Human-AI Relationships / 2410.20130 / ISBN:https://doi.org/10.48550/arXiv.2410.20130 / Published by ArXiv / Version released on 2025-01-26 / on (web) Publishing site
- The Trap of Presumed Equivalence: Artificial General Intelligence Should Not Be Assessed on the Scale of Human Intelligence / 2410.21296 / ISBN:https://doi.org/10.48550/arXiv.2410.21296 / Published by ArXiv / Version released on 2024-11-11 / on (web) Publishing site
- Using Large Language Models for a standard assessment mapping for sustainable communities / 2411.00208 / ISBN:https://doi.org/10.48550/arXiv.2411.00208 / Published by ArXiv / Version released on 2024-11-25 / on (web) Publishing site
- I Always Felt that Something Was Wrong.: Understanding Compliance Risks and Mitigation Strategies when Professionals Use Large Language Models / 2411.04576 / ISBN:https://doi.org/10.48550/arXiv.2411.04576 / Published by ArXiv / Version released on 2024-11-07 / on (web) Publishing site
- A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions / 2406.03712 / ISBN:https://doi.org/10.48550/arXiv.2406.03712 / Published by ArXiv / Version released on 2024-12-09 / on (web) Publishing site
- Persuasion with Large Language Models: A Survey of Empirical Evidence, Study Methodologies, and Ethical Implications / 2411.06837 / ISBN:https://doi.org/10.48550/arXiv.2411.06837 / Published by ArXiv / Version released on 2026-04-21 / on (web) Publishing site
- Enhancing Accessibility in Special Libraries: A Study on AI-Powered Assistive Technologies for Patrons with Disabilities / 2411.06970 / ISBN:https://doi.org/10.48550/arXiv.2411.06970 / Published by ArXiv / Version released on 2024-11-11 / on (web) Publishing site
- Collaborative Participatory Research with LLM Agents in South Asia: An Empirically-Grounded Methodological Initiative and Agenda from Field Evidence in Sri Lanka / 2411.08294 / ISBN:https://doi.org/10.48550/arXiv.2411.08294 / Published by ArXiv / Version released on 2024-11-13 / on (web) Publishing site
- Bias in Large Language Models: Origin, Evaluation, and Mitigation / 2411.10915 / ISBN:https://doi.org/10.48550/arXiv.2411.10915 / Published by ArXiv / Version released on 2026-05-01 / on (web) Publishing site
- Framework for developing and evaluating ethical collaboration between expert and machine / 2411.10983 / ISBN:https://doi.org/10.48550/arXiv.2411.10983 / Published by ArXiv / Version released on 2024-11-17 / on (web) Publishing site
- AI-Augmented Ethical Hacking: A Practical Examination of Manual Exploitation and Privilege Escalation in Linux Environments / 2411.17539 / ISBN:https://doi.org/10.48550/arXiv.2411.17539 / Published by ArXiv / Version released on 2024-11-26 / on (web) Publishing site
- Large Language Models in Politics and Democracy: A Comprehensive Survey / 2412.04498 / ISBN:https://doi.org/10.48550/arXiv.2412.04498 / Published by ArXiv / Version released on 2024-12-16 / on (web) Publishing site
- Political-LLM: Large Language Models in Political Science / 2412.06864 / ISBN:https://doi.org/10.48550/arXiv.2412.06864 / Published by ArXiv / Version released on 2024-12-09 / on (web) Publishing site
- CERN for AI: A Theoretical Framework for Autonomous Simulation-Based Artificial Intelligence Testing and Alignment / 2312.09402 / ISBN:https://doi.org/10.48550/arXiv.2312.09402 / Published by ArXiv / Version released on 2025-01-06 / on (web) Publishing site
- Large Language Model Safety: A Holistic Survey / 2412.17686 / ISBN:https://doi.org/10.48550/arXiv.2412.17686 / Published by ArXiv / Version released on 2024-12-23 / on (web) Publishing site
- INFELM: In-depth Fairness Evaluation of Large Text-To-Image Models / 2501.01973 / ISBN:https://doi.org/10.48550/arXiv.2501.01973 / Published by ArXiv / Version released on 2025-01-09 / on (web) Publishing site
- Curious, Critical Thinker, Empathetic, and Ethically Responsible: Essential Soft Skills for Data Scientists in Software Engineering / 2501.02088 / ISBN:https://doi.org/10.48550/arXiv.2501.02088 / Published by ArXiv / Version released on 2025-01-29 / on (web) Publishing site
- Hybrid Approaches for Moral Value Alignment in AI Agents: a Manifesto / 2312.01818 / ISBN:https://doi.org/10.48550/arXiv.2312.01818 / Published by ArXiv / Version released on 2025-01-16 / on (web) Publishing site
- Towards A Litmus Test for Common Sense / 2501.09913 / ISBN:https://doi.org/10.48550/arXiv.2501.09913 / Published by ArXiv / Version released on 2025-01-17 / on (web) Publishing site
- Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude / 2501.10484 / ISBN:https://doi.org/10.48550/arXiv.2501.10484 / Published by ArXiv / Version released on 2025-10-30 / on (web) Publishing site
- Harnessing the Potential of Large Language Models in Modern Marketing Management: Applications, Future Directions, and Strategic Recommendations / 2501.10685 / ISBN:https://doi.org/10.48550/arXiv.2501.10685 / Published by ArXiv / Version released on 2025-01-18 / on (web) Publishing site
- Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications
/ 2501.12456 / ISBN:https://doi.org/10.48550/arXiv.2501.12456 / Published by ArXiv / Version released on 2025-01-21 / on (web) Publishing site
- A Critical Field Guide for Working with Machine Learning Datasets / 2501.15491 / ISBN:https://doi.org/10.48550/arXiv.2501.15491 / Published by ArXiv / Version released on 2025-01-26 / on (web) Publishing site
- The Third Moment of AI Ethics: Developing Relatable and Contextualized Tools / 2501.16954 / ISBN:https://doi.org/10.48550/arXiv.2501.16954 / Published by ArXiv / Version released on 2025-01-28 / on (web) Publishing site
- Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline / 2501.18493 / ISBN:https://doi.org/10.48550/arXiv.2501.18493 / Published by ArXiv / Version released on 2025-01-30 / on (web) Publishing site
- Towards Safe AI Clinicians: A Comprehensive Study on Large Language Model Jailbreaking in Healthcare / 2501.18632 / ISBN:https://doi.org/10.48550/arXiv.2501.18632 / Published by ArXiv / Version released on 2025-01-27 / on (web) Publishing site
- Agentic AI: Expanding the Algorithmic Frontier of Creative Problem Solving / 2502.00289 / ISBN:https://doi.org/10.48550/arXiv.2502.00289 / Published by ArXiv / Version released on 2025-02-01 / on (web) Publishing site
- Constructing AI ethics narratives based on real-world data: Human-AI collaboration in data-driven visual storytelling / 2502.00637 / ISBN:https://doi.org/10.48550/arXiv.2502.00637 / Published by ArXiv / Version released on 2025-02-02 / on (web) Publishing site
- FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing / 2502.03826 / ISBN:https://doi.org/10.48550/arXiv.2502.03826 / Published by ArXiv / Version released on 2025-08-15 / on (web) Publishing site
- Open Foundation Models in Healthcare: Challenges, Paradoxes, and Opportunities with GenAI Driven Personalized Prescription / 2502.04356 / ISBN:https://doi.org/10.48550/arXiv.2502.04356 / Published by ArXiv / Version released on 2025-02-04 / on (web) Publishing site
- Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety / 2502.05206 / ISBN:https://doi.org/10.48550/arXiv.2502.05206 / Published by ArXiv / Version released on 2026-04-14 / on (web) Publishing site
- The Odyssey of the Fittest: Can Agents Survive and Still Be Good? / 2502.05442 / ISBN:https://doi.org/10.48550/arXiv.2502.05442 / Published by ArXiv / Version released on 2025-07-15 / on (web) Publishing site
- Prioritization First, Principles Second: An Adaptive Interpretation of Helpful, Honest, and Harmless Principles / 2502.06059 / ISBN:https://doi.org/10.48550/arXiv.2502.06059 / Published by ArXiv / Version released on 2025-12-27 / on (web) Publishing site
- Agentic AI for Scaling Diagnosis and Care in Neurodegenerative Disease / 2502.06842 / ISBN:https://doi.org/10.48550/arXiv.2502.06842 / Published by ArXiv / Version released on 2025-12-23 / on (web) Publishing site
- From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine
/ 2502.09242 / ISBN:https://doi.org/10.48550/arXiv.2502.09242 / Published by ArXiv / Version released on 2025-02-13 / on (web) Publishing site
- Relational Norms for Human-AI Cooperation / 2502.12102 / ISBN:https://doi.org/10.48550/arXiv.2502.12102 / Published by ArXiv / Version released on 2025-02-17 / on (web) Publishing site
- Multi-Agent Risks from Advanced AI / 2502.14143 / ISBN:https://doi.org/10.48550/arXiv.2502.14143 / Published by ArXiv / Version released on 2025-02-19 / on (web) Publishing site
- On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective / 2502.14296 / ISBN:https://doi.org/10.48550/arXiv.2502.14296 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site
- Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review / 2502.14886 / ISBN:https://doi.org/10.48550/arXiv.2502.14886 / Published by ArXiv / Version released on 2025-11-03 / on (web) Publishing site
- Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives
/ 2502.16841 / ISBN:https://doi.org/10.48550/arXiv.2502.16841 / Published by ArXiv / Version released on 2026-01-14 / on (web) Publishing site
- Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models / 2502.18505 / ISBN:https://doi.org/10.48550/arXiv.2502.18505 / Published by ArXiv / Version released on 2025-02-21 / on (web) Publishing site
- Developmental Support Approach to AI's Autonomous Growth: Toward the Realization of a Mutually Beneficial Stage Through Experiential Learning / 2502.19798 / ISBN:https://doi.org/10.48550/arXiv.2502.19798 / Published by ArXiv / Version released on 2025-02-27 / on (web) Publishing site
- An LLM-based Delphi Study to Predict GenAI Evolution / 2502.21092 / ISBN:https://doi.org/10.48550/arXiv.2502.21092 / Published by ArXiv / Version released on 2025-02-28 / on (web) Publishing site
- Evaluating Large Language Models on the Spanish Medical Intern Resident (MIR) Examination 2024/2025:A Comparative Analysis of Clinical Reasoning and Knowledge Application / 2503.00025 / ISBN:https://doi.org/10.48550/arXiv.2503.00025 / Published by ArXiv / Version released on 2025-03-16 / on (web) Publishing site
- Digital Dybbuks and Virtual Golems: AI, Memory, and the Ethics of Holocaust Testimony / 2503.01369 / ISBN:https://doi.org/10.48550/arXiv.2503.01369 / Published by ArXiv / Version released on 2025-03-03 / on (web) Publishing site
- Vision Language Models in Medicine / 2503.01863 / ISBN:https://doi.org/10.48550/arXiv.2503.01863 / Published by ArXiv / Version released on 2025-02-24 / on (web) Publishing site
- Medical Hallucinations in Foundation Models and Their Impact on Healthcare / 2503.05777 / ISBN:https://doi.org/10.48550/arXiv.2503.05777 / Published by ArXiv / Version released on 2025-02-26 / on (web) Publishing site
- Generative AI in Transportation Planning: A Survey / 2503.07158 / ISBN:https://doi.org/10.48550/arXiv.2503.07158 / Published by ArXiv / Version released on 2025-05-07 / on (web) Publishing site
- MinorBench: A hand-built benchmark for content-based risks for children / 2503.10242 / ISBN:https://doi.org/10.48550/arXiv.2503.10242 / Published by ArXiv / Version released on 2025-03-13 / on (web) Publishing site
- LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions / 2503.10486 / ISBN:https://doi.org/10.48550/arXiv.2503.10486 / Published by ArXiv / Version released on 2025-06-20 / on (web) Publishing site
- DarkBench: Benchmarking Dark Patterns in Large Language Models / 2503.10728 / ISBN:https://doi.org/10.48550/arXiv.2503.10728 / Published by ArXiv / Version released on 2025-03-13 / on (web) Publishing site
- Policy Frameworks for Transparent Chain-of-Thought Reasoning in Large Language Models / 2503.14521 / ISBN:https://doi.org/10.48550/arXiv.2503.14521 / Published by ArXiv / Version released on 2025-03-14 / on (web) Publishing site
- BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models
/ 2503.24310 / ISBN:https://doi.org/10.48550/arXiv.2503.24310 / Published by ArXiv / Version released on 2025-03-31 / on (web) Publishing site
- Bridging the Gap: Integrating Ethics and Environmental Sustainability in AI Research and Practice / 2504.00797 / ISBN:https://doi.org/10.48550/arXiv.2504.00797 / Published by ArXiv / Version released on 2025-04-01 / on (web) Publishing site
- Who Owns the Output? Bridging Law and Technology in LLMs Attribution / 2504.01032 / ISBN:https://doi.org/10.48550/arXiv.2504.01032 / Published by ArXiv / Version released on 2025-03-29 / on (web) Publishing site
- Language-Dependent Political Bias in AI: A Study of ChatGPT and Gemini / 2504.06436 / ISBN:https://doi.org/10.48550/arXiv.2504.06436 / Published by ArXiv / Version released on 2025-04-08 / on (web) Publishing site
- We Are All Creators: Generative AI, Collective Knowledge, and the Path Towards Human-AI Synergy / 2504.07936 / ISBN:https://doi.org/10.48550/arXiv.2504.07936 / Published by ArXiv / Version released on 2025-04-10 / on (web) Publishing site
- A Comprehensive Survey on Integrating Large Language Models with Knowledge-Based Methods / 2501.13947 / ISBN:https://doi.org/10.48550/arXiv.2501.13947 / Published by ArXiv / Version released on 2025-05-01 / on (web) Publishing site
- Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation / 2502.05151 / ISBN:https://doi.org/10.48550/arXiv.2502.05151 / Published by ArXiv / Version released on 2026-03-05 / on (web) Publishing site
- Who is Responsible? The Data, Models, Users or Regulations? A Comprehensive Survey on Responsible Generative AI for a Sustainable Future / 2502.08650 / ISBN:https://doi.org/10.48550/arXiv.2502.08650 / Published by ArXiv / Version released on 2025-04-28 / on (web) Publishing site
- Evaluation Framework for AI Systems in the Wild / 2504.16778 / ISBN:https://doi.org/10.48550/arXiv.2504.16778 / Published by ArXiv / Version released on 2025-04-28 / on (web) Publishing site
- Confirmation Bias in Generative AI Chatbots: Mechanisms, Risks, Mitigation Strategies, and Future Research Directions / 2504.09343 / ISBN:https://doi.org/10.48550/arXiv.2504.09343 / Published by ArXiv / Version released on 2025-04-12 / on (web) Publishing site
- Towards responsible AI for education: Hybrid human-AI to confront the Elephant in the room / 2504.16148 / ISBN:https://doi.org/10.48550/arXiv.2504.16148 / Published by ArXiv / Version released on 2025-04-22 / on (web) Publishing site
- Approaches to Responsible Governance of GenAI in Organizations / 2504.17044 / ISBN:https://doi.org/10.48550/arXiv.2504.17044 / Published by ArXiv / Version released on 2025-09-14 / on (web) Publishing site
- Auditing the Ethical Logic of Generative AI Models / 2504.17544 / ISBN:https://doi.org/10.48550/arXiv.2504.17544 / Published by ArXiv / Version released on 2025-04-24 / on (web) Publishing site
- TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models / 2504.20605 / ISBN:https://doi.org/10.48550/arXiv.2504.20605 / Published by ArXiv / Version released on 2026-05-02 / on (web) Publishing site
- Generative AI in Financial Institution: A Global Survey of Opportunities, Threats, and Regulation / 2504.21574 / ISBN:https://doi.org/10.48550/arXiv.2504.21574 / Published by ArXiv / Version released on 2025-04-30 / on (web) Publishing site
- From Texts to Shields: Convergence of Large Language Models and Cybersecurity / 2505.00841 / ISBN:https://doi.org/10.48550/arXiv.2505.00841 / Published by ArXiv / Version released on 2025-05-01 / on (web) Publishing site
- LLM Ethics Benchmark: A Three-Dimensional Assessment System for Evaluating Moral Reasoning in Large Language Models / 2505.00853 / ISBN:https://doi.org/10.48550/arXiv.2505.00853 / Published by ArXiv / Version released on 2025-05-01 / on (web) Publishing site
- Emotions in the Loop: A Survey of Affective Computing for Emotional Support / 2505.01542 / ISBN:https://doi.org/10.48550/arXiv.2505.01542 / Published by ArXiv / Version released on 2025-05-02 / on (web) Publishing site
- Towards Safer Pretraining: Analyzing and Filtering Harmful Content in Webscale datasets for Responsible LLMs / 2505.02009 / ISBN:https://doi.org/10.48550/arXiv.2505.02009 / Published by ArXiv / Version released on 2025-08-12 / on (web) Publishing site
- AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques / 2505.08202 / ISBN:https://doi.org/10.48550/arXiv.2505.08202 / Published by ArXiv / Version released on 2025-05-13 / on (web) Publishing site
- WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models / 2505.09595 / ISBN:https://doi.org/10.48550/arXiv.2505.09595 / Published by ArXiv / Version released on 2025-05-14 / on (web) Publishing site
- Analysing Safety Risks in LLMs Fine-Tuned with Pseudo-Malicious Cyber Security Data / 2505.09974 / ISBN:https://doi.org/10.48550/arXiv.2505.09974 / Published by ArXiv / Version released on 2025-05-15 / on (web) Publishing site
- AI LEGO: Scaffolding Cross-Functional Collaboration in Industrial Responsible AI Practices during Early Design Stages / 2505.10300 / ISBN:https://doi.org/10.48550/arXiv.2505.10300 / Published by ArXiv / Version released on 2025-05-15 / on (web) Publishing site
- Let's have a chat with the EU AI Act / 2505.11946 / ISBN:https://doi.org/10.48550/arXiv.2505.11946 / Published by ArXiv / Version released on 2025-05-17 / on (web) Publishing site
- From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery / 2505.13259 / ISBN:https://doi.org/10.48550/arXiv.2505.13259 / Published by ArXiv / Version released on 2025-09-17 / on (web) Publishing site
- AI vs. Human Judgment of Content Moderation: LLM-as-a-Judge and Ethics-Based Response Refusals / 2505.15365 / ISBN:https://doi.org/10.48550/arXiv.2505.15365 / Published by ArXiv / Version released on 2025-05-21 / on (web) Publishing site
- Advancing the Scientific Method with Large Language Models: From Hypothesis to Discovery / 2505.16477 / ISBN:https://doi.org/10.48550/arXiv.2505.16477 / Published by ArXiv / Version released on 2025-05-22 / on (web) Publishing site
- A Toolkit for Compliance, a Toolkit for Justice: Drawing on Cross-sectoral Expertise to Develop a Pro-justice EU AI Act Toolkit / 2505.17165 / ISBN:https://doi.org/10.48550/arXiv.2505.17165 / Published by ArXiv / Version released on 2025-05-22 / on (web) Publishing site
- SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use / 2505.17332 / ISBN:https://doi.org/10.48550/arXiv.2505.17332 / Published by ArXiv / Version released on 2025-05-22 / on (web) Publishing site
- Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods / 2505.17870 / ISBN:https://doi.org/10.48550/arXiv.2505.17870 / Published by ArXiv / Version released on 2025-05-23 / on (web) Publishing site
- Making Sense of the Unsensible: Reflection, Survey, and Challenges for XAI in Large Language Models Toward Human-Centered AI / 2505.20305 / ISBN:https://doi.org/10.48550/arXiv.2505.20305 / Published by ArXiv / Version released on 2025-05-18 / on (web) Publishing site
- Can we Debias Social Stereotypes in AI-Generated Images? Examining Text-to-Image Outputs and User Perceptions / 2505.20692 / ISBN:https://doi.org/10.48550/arXiv.2505.20692 / Published by ArXiv / Version released on 2025-05-27 / on (web) Publishing site
- Are Language Models Consequentialist or Deontological Moral Reasoners? / 2505.21479 / ISBN:https://doi.org/10.48550/arXiv.2505.21479 / Published by ArXiv / Version released on 2025-10-12 / on (web) Publishing site
- SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents / 2505.23559 / ISBN:https://doi.org/10.48550/arXiv.2505.23559 / Published by ArXiv / Version released on 2025-05-29 / on (web) Publishing site
- Locating Risk: Task Designers and the Challenge of Risk Disclosure in RAI Content Work / 2505.24246 / ISBN:https://doi.org/10.48550/arXiv.2505.24246 / Published by ArXiv / Version released on 2026-03-31 / on (web) Publishing site
- Wide Reflective Equilibrium in LLM Alignment: Bridging Moral Epistemology and AI Safety
/ 2506.00415 / ISBN:https://doi.org/10.48550/arXiv.2506.00415 / Published by ArXiv / Version released on 2025-05-31 / on (web) Publishing site
- DeepSeek in Healthcare: A Survey of Capabilities, Risks, and Clinical Applications of Open-Source Large Language Models / 2506.01257 / ISBN:https://doi.org/10.48550/arXiv.2506.01257 / Published by ArXiv / Version released on 2025-06-02 / on (web) Publishing site
- Machine vs Machine: Using AI to Tackle Generative AI Threats in Assessment / 2506.02046 / ISBN:https://doi.org/10.48550/arXiv.2506.02046 / Published by ArXiv / Version released on 2025-05-31 / on (web) Publishing site
- Mitigating Manipulation and Enhancing Persuasion: A Reflective Multi-Agent Approach for Legal Argument Generation / 2506.02992 / ISBN:https://doi.org/10.48550/arXiv.2506.02992 / Published by ArXiv / Version released on 2025-06-03 / on (web) Publishing site
- Feeling Machines: Ethics, Culture, and the Rise of Emotional AI / 2506.12437 / ISBN:https://doi.org/10.48550/arXiv.2506.12437 / Published by ArXiv / Version released on 2025-06-14 / on (web) Publishing site
- A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications / 2506.12594 / ISBN:https://doi.org/10.48550/arXiv.2506.12594 / Published by ArXiv / Version released on 2025-06-14 / on (web) Publishing site
- Foundation of Affective Computing and Interaction
/ 2506.15497 / ISBN:https://doi.org/10.48550/arXiv.2506.15497 / Published by ArXiv / Version released on 2025-06-18 / on (web) Publishing site
- JETHICS: Japanese Ethics Understanding Evaluation Dataset
/ 2506.16187 / ISBN:https://doi.org/10.48550/arXiv.2506.16187 / Published by ArXiv / Version released on 2025-06-19 / on (web) Publishing site
- SafeTriage: Facial Video De-identification for Privacy-Preserving Stroke Triage / 2506.16578 / ISBN:https://doi.org/10.48550/arXiv.2506.16578 / Published by ArXiv / Version released on 2025-06-19 / on (web) Publishing site
- AI based Content Creation and Product Recommendation Applications in E-commerce: An Ethical overview / 2506.17370 / ISBN:https://doi.org/10.48550/arXiv.2506.17370 / Published by ArXiv / Version released on 2025-06-20 / on (web) Publishing site
- AI Through the Human Lens: Investigating Cognitive Theories in Machine Psychology
/ 2506.18156 / ISBN:https://doi.org/10.48550/arXiv.2506.18156 / Published by ArXiv / Version released on 2025-11-07 / on (web) Publishing site
- Can AI be Consentful? / 2507.01051 / ISBN:https://doi.org/10.48550/arXiv.2507.01051 / Published by ArXiv / Version released on 2025-06-27 / on (web) Publishing site
- A Practical SAFE-AI Framework for Small and Medium-Sized Enterprises Developing Medical Artificial Intelligence Ethics Policies / 2507.01304 / ISBN:https://doi.org/10.48550/arXiv.2507.01304 / Published by ArXiv / Version released on 2025-07-02 / on (web) Publishing site
- Model Cards Revisited: Bridging the Gap Between Theory and Practice for Ethical AI Requirements / 2507.06014 / ISBN:https://doi.org/10.48550/arXiv.2507.06014 / Published by ArXiv / Version released on 2025-07-08 / on (web) Publishing site
- When Large Language Models Meet Law: Dual-Lens Taxonomy, Technical Advances, and Ethical Governance / 2507.07748 / ISBN:https://doi.org/10.48550/arXiv.2507.07748 / Published by ArXiv / Version released on 2025-07-10 / on (web) Publishing site
- Deepfake Technology Unveiled: The Commoditization of AI and Its Impact on Digital Trust / 2506.07363 / ISBN:https://doi.org/10.48550/arXiv.2506.07363 / Published by ArXiv / Version released on 2025-07-15 / on (web) Publishing site
- Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics / 2506.12365 / ISBN:https://doi.org/10.48550/arXiv.2506.12365 / Published by ArXiv / Version released on 2025-07-31 / on (web) Publishing site
- Policy-Driven AI in Dataspaces: Taxonomy, Explainability, and Pathways for Compliant Innovation / 2507.20014 / ISBN:https://doi.org/10.48550/arXiv.2507.20014 / Published by ArXiv / Version released on 2025-07-30 / on (web) Publishing site
- The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and Scientist / 2507.11810 / ISBN:https://doi.org/10.48550/arXiv.2507.11810 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site
- Redefining Elderly Care with Agentic AI: Challenges and Opportunities / 2507.14912 / ISBN:https://doi.org/10.48550/arXiv.2507.14912 / Published by ArXiv / Version released on 2025-07-20 / on (web) Publishing site
- Advancing Responsible Innovation in Agentic AI: A study of Ethical Frameworks for Household Automation / 2507.15901 / ISBN:https://doi.org/10.48550/arXiv.2507.15901 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site
- Beyond Algorethics: Addressing the Ethical and Anthropological Challenges of AI Recommender Systems / 2507.16430 / ISBN:https://doi.org/10.48550/arXiv.2507.16430 / Published by ArXiv / Version released on 2025-07-22 / on (web) Publishing site
- Defining ethically sourced code generation / 2507.19743 / ISBN:https://doi.org/10.48550/arXiv.2507.19743 / Published by ArXiv / Version released on 2025-07-26 / on (web) Publishing site
- EthicAlly: a Prototype for AI-Powered Research Ethics Support for the Social Sciences and Humanities / 2508.00856 / ISBN:https://doi.org/10.48550/arXiv.2508.00856 / Published by ArXiv / Version released on 2025-07-15 / on (web) Publishing site
- DIRF: A Framework for Digital Identity Protection and Clone Governance in Agentic AI Systems / 2508.01997 / ISBN:https://doi.org/10.48550/arXiv.2508.01997 / Published by ArXiv / Version released on 2025-09-08 / on (web) Publishing site
- The Silicon Reasonable Person: Can AI Predict How Ordinary People Judge Reasonableness? / 2508.02766 / ISBN:https://doi.org/10.48550/arXiv.2508.02766 / Published by ArXiv / Version released on 2025-08-04 / on (web) Publishing site
- Data and AI governance: Promoting equity, ethics, and fairness in large language models / 2508.03970 / ISBN:https://doi.org/10.48550/arXiv.2508.03970 / Published by ArXiv / Version released on 2025-08-05 / on (web) Publishing site
- PrinciplismQA: A Philosophy-Grounded Approach to Assessing LLM-Human Clinical Medical Ethics Alignment / 2508.05132 / ISBN:https://doi.org/10.48550/arXiv.2508.05132 / Published by ArXiv / Version released on 2026-04-20 / on (web) Publishing site
- A Methodological Framework and Questionnaire for Investigating Perceived Algorithmic Fairness / 2508.05281 / ISBN:https://doi.org/10.48550/arXiv.2508.05281 / Published by ArXiv / Version released on 2025-08-07 / on (web) Publishing site
- Do Ethical AI Principles Matter to Users? A Large-Scale Analysis of User Sentiment and Satisfaction / 2508.05913 / ISBN:https://doi.org/10.48550/arXiv.2508.05913 / Published by ArXiv / Version released on 2025-08-08 / on (web) Publishing site
- The Fair Game: Auditing & Debiasing AI Algorithms Over Time / 2508.06443 / ISBN:https://doi.org/10.48550/arXiv.2508.06443 / Published by ArXiv / Version released on 2025-08-08 / on (web) Publishing site
- A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems / 2508.07407 / ISBN:https://doi.org/10.48550/arXiv.2508.07407 / Published by ArXiv / Version released on 2025-08-31 / on (web) Publishing site
- Ethical Concerns of Generative AI and Mitigation Strategies: A Systematic Mapping Study / 2502.00015 / ISBN:https://doi.org/10.48550/arXiv.2502.00015 / Published by ArXiv / Version released on 2025-08-22 / on (web) Publishing site
- Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance / 2508.08789 / ISBN:https://doi.org/10.48550/arXiv.2508.08789 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site
- A Comprehensive Review of Datasets for Clinical Mental Health AI Systems / 2508.09809 / ISBN:https://doi.org/10.48550/arXiv.2508.09809 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site
- A Systematic Survey of Model Extraction Attacks and Defenses: State-of-the-Art and Perspectives / 2508.15031 / ISBN:https://doi.org/10.48550/arXiv.2508.15031 / Published by ArXiv / Version released on 2025-08-27 / on (web) Publishing site
- A Comprehensive Review of AI Agents: Transforming Possibilities in Technology and Beyond / 2508.11957 / ISBN:https://doi.org/10.48550/arXiv.2508.11957 / Published by ArXiv / Version released on 2025-08-16 / on (web) Publishing site
- The AI-Fraud Diamond: A Novel Lens for Auditing Algorithmic Deception / 2508.13984 / ISBN:https://doi.org/10.48550/arXiv.2508.13984 / Published by ArXiv / Version released on 2025-08-19 / on (web) Publishing site
- The Agent Behavior: Model, Governance and Challenges in the AI Digital Age / 2508.14415 / ISBN:https://doi.org/10.48550/arXiv.2508.14415 / Published by ArXiv / Version released on 2025-08-20 / on (web) Publishing site
- Bridging Minds and Machines: Toward an Integration of AI and Cognitive Science / 2508.20674 / ISBN:https://doi.org/10.48550/arXiv.2508.20674 / Published by ArXiv / Version released on 2025-08-28 / on (web) Publishing site
- Beyond Prediction: Reinforcement Learning as the Defining Leap in Healthcare AI / 2508.21101 / ISBN:https://doi.org/10.48550/arXiv.2508.21101 / Published by ArXiv / Version released on 2025-08-28 / on (web) Publishing site
- Leveraging Imperfection with MEDLEY A Multi-Model Approach Harnessing Bias in Medical AI / 2508.21648 / ISBN:https://doi.org/10.48550/arXiv.2508.21648 / Published by ArXiv / Version released on 2025-08-29 / on (web) Publishing site
- Designing LMS and Instructional Strategies for Integrating Generative-Conversational AI / 2509.00709 / ISBN:https://doi.org/10.48550/arXiv.2509.00709 / Published by ArXiv / Version released on 2025-08-31 / on (web) Publishing site
- The Ethical Compass of the Machine: Evaluating Large Language Models for Decision Support in Construction Project Management / 2509.04505 / ISBN:https://doi.org/10.48550/arXiv.2509.04505 / Published by ArXiv / Version released on 2025-09-02 / on (web) Publishing site
- Between a Rock and a Hard Place: Exploiting Ethical Reasoning to Jailbreak LLMs / 2509.05367 / ISBN:https://doi.org/10.48550/arXiv.2509.05367 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site
- ArGen: Auto-Regulation of Generative AI via GRPO and Policy-as-Code / 2509.07006 / ISBN:https://doi.org/10.48550/arXiv.2509.07006 / Published by ArXiv / Version released on 2025-09-06 / on (web) Publishing site
- Evaluating the Clinical Safety of LLMs in Response to High-Risk Mental Health Disclosures / 2509.08839 / ISBN:https://doi.org/10.48550/arXiv.2509.08839 / Published by ArXiv / Version released on 2025-09-01 / on (web) Publishing site
- Safe and Certifiable AI Systems: Concepts, Challenges, and Lessons Learned / 2509.08852 / ISBN:https://doi.org/10.48550/arXiv.2509.08852 / Published by ArXiv / Version released on 2025-09-08 / on (web) Publishing site
- Enhancing Clinical Decision-Making: Integrating Multi-Agent Systems with Ethical AI Governance
/ 2504.03699 / ISBN:https://doi.org/10.48550/arXiv.2504.03699 / Published by ArXiv / Version released on 2025-09-22 / on (web) Publishing site
- Web3 x AI Agents: Landscape, Integrations, and Foundational Challenges / 2508.02773 / ISBN:https://doi.org/10.48550/arXiv.2508.02773 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site
- AI and the Future of Academic Peer Review
/ 2509.14189 / ISBN:https://doi.org/10.48550/arXiv.2509.14189 / Published by ArXiv / Version released on 2026-02-27 / on (web) Publishing site
- Understanding the Process of Human-AI Value Alignment / 2509.13854 / ISBN:https://doi.org/10.48550/arXiv.2509.13854 / Published by ArXiv / Version released on 2025-09-17 / on (web) Publishing site
- Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models / 2509.16332 / ISBN:https://doi.org/10.48550/arXiv.2509.16332 / Published by ArXiv / Version released on 2025-09-19 / on (web) Publishing site
- TVS Sidekick: Challenges and Practical Insights from Deploying Large Language Models in the Enterprise / 2509.26482 / ISBN:https://doi.org/10.48550/arXiv.2509.26482 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site
- Reconsidering Requirements Engineering: Human-AI Collaboration in AI-Native Software Development / 2510.04380 / ISBN:https://doi.org/10.1007/978-3-032-04190-6_11 / Published by ArXiv / Version released on 2025-10-05 / on (web) Publishing site
- Building an Open AIBOM Standard in the Wild / 2510.07070 / ISBN:https://doi.org/10.48550/arXiv.2510.07070 / Published by ArXiv / Version released on 2026-02-22 / on (web) Publishing site
- Using Generative Artificial Intelligence Creatively in the Classroom and Research: Examples and Lessons Learned / 2409.05176 / ISBN:https://doi.org/10.48550/arXiv.2409.05176 / Published by ArXiv / Version released on 2025-10-24 / on (web) Publishing site
- Toward a Public and Secure Generative AI: A Comparative Analysis of Open and Closed LLMs / 2505.10603 / ISBN:https://doi.org/10.48550/arXiv.2505.10603 / Published by ArXiv / Version released on 2025-10-30 / on (web) Publishing site
- The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs
/ 2506.11094 / ISBN:https://doi.org/10.48550/arXiv.2506.11094 / Published by ArXiv / Version released on 2025-10-30 / on (web) Publishing site
- AI Alignment vs. AI Ethical Treatment: 10 Challenges / 2510.12844 / ISBN:https://doi.org/10.48550/arXiv.2510.12844 / Published by ArXiv / Version released on 2025-10-14 / on (web) Publishing site
- How Can AI Augment Access to Justice? Public Defenders' Perspectives on AI Adoption / 2510.22933 / ISBN:https://doi.org/10.48550/arXiv.2510.22933 / Published by ArXiv / Version released on 2026-04-25 / on (web) Publishing site
- Towards Human-AI Synergy in Requirements Engineering: A Framework and Preliminary Study / 2510.25016 / ISBN:https://doi.org/10.48550/arXiv.2510.25016 / Published by ArXiv / Version released on 2025-10-28 / on (web) Publishing site
- Diverse Human Value Alignment for Large Language Models via Ethical Reasoning / 2511.00379 / ISBN:https://doi.org/10.48550/arXiv.2511.00379 / Published by ArXiv / Version released on 2025-11-01 / on (web) Publishing site
- Systematizing LLM Persona Design: A Four-Quadrant Technical Taxonomy for AI Companion Applications / 2511.02979 / ISBN:https://doi.org/10.48550/arXiv.2511.02979 / Published by ArXiv / Version released on 2026-01-23 / on (web) Publishing site
- Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming / 2511.15998 / ISBN:https://doi.org/10.48550/arXiv.2511.15998 / Published by ArXiv / Version released on 2025-11-21 / on (web) Publishing site
- Knowing Ourselves Through Others: Reflecting with AI in Digital Human Debates / 2511.13046 / ISBN:https://doi.org/10.48550/arXiv.2511.13046 / Published by ArXiv / Version released on 2025-11-17 / on (web) Publishing site
- Person-AI Bidirectional Fit - A Proof-Of-Concept Case Study Of Augmented Human-Ai Symbiosis In Management Decision-Making Process / 2511.13670 / ISBN:https://doi.org/10.48550/arXiv.2511.13670 / Published by ArXiv / Version released on 2025-11-17 / on (web) Publishing site
- Cross-cultural value alignment frameworks for responsible AI governance: Evidence from China-West comparative analysis / 2511.17256 / ISBN:https://doi.org/10.48550/arXiv.2511.17256 / Published by ArXiv / Version released on 2025-11-21 / on (web) Publishing site
- Towards Synergistic Teacher-AI Interactions with Generative Artificial Intelligence / 2511.19580 / ISBN:https://doi.org/10.48550/arXiv.2511.19580 / Published by ArXiv / Version released on 2025-11-24 / on (web) Publishing site
- Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development / 2511.20623 / ISBN:https://doi.org/10.48550/arXiv.2511.20623 / Published by ArXiv / Version released on 2025-11-25 / on (web) Publishing site
- Morality in AI. A plea to embed morality in LLM architectures and frameworks / 2511.20689 / ISBN:https://doi.org/10.48550/arXiv.2511.20689 / Published by ArXiv / Version released on 2025-11-21 / on (web) Publishing site
- From Prediction to Foresight: The Role of AI in Designing Responsible Futures / 2511.21570 / ISBN:https://doi.org/10.48550/arXiv.2511.21570 / Published by ArXiv / Version released on 2025-11-26 / on (web) Publishing site
- Enabling Ethical AI: A case study in using Ontological Context for Justified Agentic AI Decisions / 2512.04822 / ISBN:https://doi.org/10.48550/arXiv.2512.04822 / Published by ArXiv / Version released on 2025-12-04 / on (web) Publishing site
- Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research / 2512.10058 / ISBN:https://doi.org/10.48550/arXiv.2512.10058 / Published by ArXiv / Version released on 2025-12-10 / on (web) Publishing site
- Opportunities and Challenges of Large Language Models for Low-Resource Languages in Humanities Research / 2412.04497 / ISBN:https://doi.org/10.48550/arXiv.2412.04497 / Published by ArXiv / Version released on 2026-04-17 / on (web) Publishing site
- Ethics Practices in AI Development: An Empirical Study Across Roles and Regions / 2508.09219 / ISBN:https://doi.org/10.48550/arXiv.2508.09219 / Published by ArXiv / Version released on 2025-12-13 / on (web) Publishing site
- AI Sprints: Towards a Critical Method for Human-AI Collaboration
/ 2512.12371 / ISBN:https://doi.org/10.48550/arXiv.2512.12371 / Published by ArXiv / Version released on 2025-12-13 / on (web) Publishing site
- SafeGen: Embedding Ethical Safeguards in Text-to-Image Generation / 2512.12501 / ISBN:https://doi.org/10.48550/arXiv.2512.12501 / Published by ArXiv / Version released on 2025-12-14 / on (web) Publishing site
- Evaluation of AI Ethics Tools in Language Models: A Developers' Perspective Case Stud / 2512.15791 / ISBN:https://doi.org/10.48550/arXiv.2512.15791 / Published by ArXiv / Version released on 2025-12-16 / on (web) Publishing site
- Legal Alignment for Safe and Ethical AI / 2601.04175 / ISBN:https://doi.org/10.48550/arXiv.2601.04175 / Published by ArXiv / Version released on 2026-01-07 / on (web) Publishing site
- Semantic Alignment Between Normative Theories of Ethics and the European Union Artificial Intelligence Act: A Transformer-Based Semantic Textual Similarity Analysis
/ 2601.13372 / ISBN:https://doi.org/10.48550/arXiv.2601.13372 / Published by ArXiv / Version released on 2026-05-08 / on (web) Publishing site
- Epistemic Constitutionalism Or: how to avoid coherence bias / 2601.14295 / ISBN:https://doi.org/10.48550/arXiv.2601.14295 / Published by ArXiv / Version released on 2026-04-22 / on (web) Publishing site
- Reimagining Legal Fact Verification with GenAI: Toward Effective Human-AI Collaboration / 2602.06305 / ISBN:https://doi.org/10.48550/arXiv.2602.06305 / Published by ArXiv / Version released on 2026-02-09 / on (web) Publishing site
- Guardrails for trust, safety, and ethical development and deployment of Large Language Models (LLM) / 2601.14298 / ISBN:https://doi.org/10.48550/arXiv.2601.14298 / Published by ArXiv / Version released on 2026-01-16 / on (web) Publishing site
- Unsupervised Elicitation of Moral Values from Language Models / 2601.17728 / ISBN:https://doi.org/10.48550/arXiv.2601.17728 / Published by ArXiv / Version released on 2026-01-25 / on (web) Publishing site
- Futuring Social Assemblages: How Enmeshing AIs into Social Life Challenges the Individual and the Interpersonal / 2602.03958 / ISBN:https://doi.org/10.48550/arXiv.2602.03958 / Published by ArXiv / Version released on 2026-02-03 / on (web) Publishing site
- Trustworthy AI Software Engineers / 2602.06310 / ISBN:https://doi.org/10.48550/arXiv.2602.06310 / Published by ArXiv / Version released on 2026-02-06 / on (web) Publishing site
- Reliable and Responsible Foundation Models: A Comprehensive Survey / 2602.08145 / ISBN:https://doi.org/10.48550/arXiv.2602.08145 / Published by ArXiv / Version released on 2026-02-04 / on (web) Publishing site
- Can LLMs Synthesize Court-Ready Statistical Evidence? Evaluating AI-Assisted Sentencing Bias Analysis for California Racial Justice Act Claims / 2603.04804 / ISBN:https://doi.org/10.48550/arXiv.2603.04804 / Version released on 2026-03-05 / on (web) Publishing site
- Building the ethical AI framework of the future: from philosophy to practice
/ 2603.06599 / ISBN:https://doi.org/10.48550/arXiv.2603.06599 / Version released on 2026-02-16 / on (web) Publishing site
- Must Read: A Comprehensive Survey of Computational Persuasion / 2505.07775 / ISBN:https://doi.org/10.48550/arXiv.2505.07775 / Version released on 2026-03-23 / on (web) Publishing site
- Unilateral Relationship Revision Power in Human-AI Companion Interaction / 2603.23315 / ISBN:https://doi.org/10.48550/arXiv.2603.23315 / Published by ArXiv / Version released on 2026-04-28 / on (web) Publishing site
- The Landscape of Generative AI in Information Systems: A Synthesis of Secondary Reviews and Research Agendas / 2603.11842 / ISBN:https://doi.org/10.48550/arXiv.2603.11842 / Version released on 2026-03-12 / on (web) Publishing site
- Bridging the Gap in the Responsible AI Divides
/ 2603.14495 / ISBN:https://doi.org/10.48550/arXiv.2603.14495 / Version released on 2026-03-15 / on (web) Publishing site
- AI Integrity: A New Paradigm for Verifiable AI Governance / 2604.11065 / ISBN:https://doi.org/10.48550/arXiv.2604.11065 / Version released on 2026-04-13 / on (web) Publishing site
- Ethics Testing: Proactive Identification of Generative AI System Harms
/ 2604.22089 / ISBN:https://doi.org/10.48550/arXiv.2604.22089 / Version released on 2026-04-23 / on (web) Publishing site
- From Review to Design: Ethical Multimodal Driver Monitoring Systems for Risk Mitigation, Incident Response, and Accountability in Automated Vehicles / 2605.06439 / ISBN:https://doi.org/10.48550/arXiv.2605.06439 / Version released on 2026-05-07 / on (web) Publishing site
- Reflections and New Directions for Human-Centered Large Language Models / 2605.06901 / ISBN:https://doi.org/10.48550/arXiv.2605.06901 / Version released on 2026-05-07 / on (web) Publishing site
- LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey / 2505.00753 / ISBN:https://doi.org/10.48550/arXiv.2505.00753 / Version released on 2026-05-06 / on (web) Publishing site
- The Thin Line Between Comprehension and Persuasion in LLMs / 2507.01936 / ISBN:https://doi.org/10.48550/arXiv.2507.01936 / Version released on 2026-04-18 / on (web) Publishing site
- Co-Constructing Alignment: A Participatory Approach to Situate AI Values / 2601.15895 / ISBN:https://doi.org/10.48550/arXiv.2601.15895 / Version released on 2026-04-21 / on (web) Publishing site
- A Co-Evolutionary Theory of Human-AI Coexistence: Mutualism, Governance, and Dynamics in Complex Societies / 2604.22227 / ISBN:https://doi.org/10.48550/arXiv.2604.22227 / Version released on 2026-04-29 / on (web) Publishing site
_