_
RobertoLofaro.com - Knowledge Portal - human-generated content
Change, with and without technology - human, AI, scraping readers welcome
for updates on publications, follow: on Instagram, Twitter, Patreon, YouTube


_

You are now here: AI Ethics Primer - search within the bibliography - version 0.4 of 2023-12-13 > (tag cloud) >tag_selected: zheng


Currently searching for:

if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long


if you modify the keywords, press enter within the field to confirm the new search key

Tag: zheng

Bibliography items where occurs: 142
The Promise and Peril of Artificial Intelligence -- Violet Teaming Offers a Balanced Path Forward / 2308.14253 / ISBN:https://doi.org/10.48550/arXiv.2308.14253 / Published by ArXiv / Version released on 2023-08-28 / on (web) Publishing site


The AI Revolution: Opportunities and Challenges for the Finance Sector / 2308.16538 / ISBN:https://doi.org/10.48550/arXiv.2308.16538 / Published by ArXiv / Version released on 2023-08-31 / on (web) Publishing site


Pathway to Future Symbiotic Creativity / 2209.02388 / ISBN:https://doi.org/10.48550/arXiv.2209.02388 / Published by ArXiv / Version released on 2023-09-13 / on (web) Publishing site


The Cambridge Law Corpus: A Corpus for Legal AI Research / 2309.12269 / ISBN:https://doi.org/10.48550/arXiv.2309.12269 / Published by ArXiv / Version released on 2024-01-01 / on (web) Publishing site


Ethics of Artificial Intelligence and Robotics in the Architecture, Engineering, and Construction Industry / 2310.05414 / ISBN:https://doi.org/10.48550/arXiv.2310.05414 / Published by ArXiv / Version released on 2023-10-09 / on (web) Publishing site


The Self 2.0: How AI-Enhanced Self-Clones Transform Self-Perception and Improve Presentation Skills / 2310.15112 / ISBN:https://doi.org/10.48550/arXiv.2310.15112 / Published by ArXiv / Version released on 2023-10-23 / on (web) Publishing site


Synergizing Human-AI Agency: A Guide of 23 Heuristics for Service Co-Creation with LLM-Based Agents / 2310.15065 / ISBN:https://doi.org/10.48550/arXiv.2310.15065 / Published by ArXiv / Version released on 2023-11-29 / on (web) Publishing site


Survey on AI Ethics: A Socio-technical Perspective / 2311.17228 / ISBN:https://doi.org/10.48550/arXiv.2311.17228 / Published by ArXiv / Version released on 2025-11-04 / on (web) Publishing site


Control Risk for Potential Misuse of Artificial Intelligence in Science / 2312.06632 / ISBN:https://doi.org/10.48550/arXiv.2312.06632 / Published by ArXiv / Version released on 2023-12-11 / on (web) Publishing site


Exploring the Frontiers of LLMs in Psychological Applications: A Comprehensive Review / 2401.01519 / ISBN:https://doi.org/10.48550/arXiv.2401.01519 / Published by ArXiv / Version released on 2025-04-20 / on (web) Publishing site


Detecting Multimedia Generated by Large AI Models: A Survey / 2402.00045 / ISBN:https://doi.org/10.48550/arXiv.2402.00045 / Published by ArXiv / Version released on 2025-07-26 / on (web) Publishing site


POLARIS: A framework to guide the development of Trustworthy AI systems / 2402.05340 / ISBN:https://doi.org/10.48550/arXiv.2402.05340 / Published by ArXiv / Version released on 2024-02-08 / on (web) Publishing site


I Think, Therefore I am: Benchmarking Awareness of Large Language Models Using AwareBench / 2401.17882 / ISBN:https://doi.org/10.48550/arXiv.2401.17882 / Published by ArXiv / Version released on 2024-02-16 / on (web) Publishing site


Mapping the Ethics of Generative AI: A Comprehensive Scoping Review / 2402.08323 / ISBN:https://doi.org/10.48550/arXiv.2402.08323 / Published by ArXiv / Version released on 2024-02-13 / on (web) Publishing site


User Modeling and User Profiling: A Comprehensive Survey / 2402.09660 / ISBN:https://doi.org/10.48550/arXiv.2402.09660 / Published by ArXiv / Version released on 2024-02-20 / on (web) Publishing site


A Survey on Human-AI Collaboration with Large Foundation Models / 2403.04931 / ISBN:https://doi.org/10.48550/arXiv.2403.04931 / Published by ArXiv / Version released on 2025-09-02 / on (web) Publishing site


AI Alignment: A Comprehensive Survey / 2310.19852 / ISBN:https://doi.org/10.48550/arXiv.2310.19852 / Published by ArXiv / Version released on 2025-04-04 / on (web) Publishing site


Not a Swiss Army Knife: Academics' Perceptions of Trade-Offs Around Generative Artificial Intelligence Use / 2405.00995 / ISBN:https://doi.org/10.48550/arXiv.2405.00995 / Published by ArXiv / Version released on 2025-08-25 / on (web) Publishing site


A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law / 2405.01769 / ISBN:https://doi.org/10.48550/arXiv.2405.01769 / Published by ArXiv / Version released on 2024-11-21 / on (web) Publishing site


Current state of LLM Risks and AI Guardrails / 2406.12934 / ISBN:https://doi.org/10.48550/arXiv.2406.12934 / Published by ArXiv / Version released on 2024-06-16 / on (web) Publishing site


Artificial intelligence, rationalization, and the limits of control in the public sector: the case of tax policy optimization / 2407.05336 / ISBN:https://doi.org/10.48550/arXiv.2407.05336 / Published by ArXiv / Version released on 2024-07-07 / on (web) Publishing site


The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources / 2406.16746 / ISBN:https://doi.org/10.48550/arXiv.2406.16746 / Published by ArXiv / Version released on 2024-09-03 / on (web) Publishing site


Data-Centric Foundation Models in Computational Healthcare: A Survey / 2401.02458 / ISBN:https://doi.org/10.48550/arXiv.2401.02458 / Published by ArXiv / Version released on 2026-04-29 / on (web) Publishing site


Improving governance outcomes through AI documentation: Bridging theory and practice / 2409.08960 / ISBN:https://doi.org/10.48550/arXiv.2409.08960 / Published by ArXiv / Version released on 2024-12-09 / on (web) Publishing site


DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life / 2410.02683 / ISBN:https://doi.org/10.48550/arXiv.2410.02683 / Published by ArXiv / Version released on 2025-03-15 / on (web) Publishing site


Navigating the Cultural Kaleidoscope: A Hitchhiker's Guide to Sensitivity in Large Language Models / 2410.12880 / ISBN:https://doi.org/10.48550/arXiv.2410.12880 / Published by ArXiv / Version released on 2025-01-24 / on (web) Publishing site


Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems / 2410.13334 / ISBN:https://doi.org/10.48550/arXiv.2410.13334 / Published by ArXiv / Version released on 2024-10-23 / on (web) Publishing site


Jailbreaking and Mitigation of Vulnerabilities in Large Language Models / 2410.15236 / ISBN:https://doi.org/10.48550/arXiv.2410.15236 / Published by ArXiv / Version released on 2025-11-25 / on (web) Publishing site


Web Scraping for Research: Legal, Ethical, Institutional, and Scientific Considerations / 2410.23432 / ISBN:https://doi.org/10.48550/arXiv.2410.23432 / Published by ArXiv / Version released on 2024-12-19 / on (web) Publishing site


A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions / 2406.03712 / ISBN:https://doi.org/10.48550/arXiv.2406.03712 / Published by ArXiv / Version released on 2024-12-09 / on (web) Publishing site


Persuasion with Large Language Models: A Survey of Empirical Evidence, Study Methodologies, and Ethical Implications / 2411.06837 / ISBN:https://doi.org/10.48550/arXiv.2411.06837 / Published by ArXiv / Version released on 2026-04-21 / on (web) Publishing site


Collaborative Participatory Research with LLM Agents in South Asia: An Empirically-Grounded Methodological Initiative and Agenda from Field Evidence in Sri Lanka / 2411.08294 / ISBN:https://doi.org/10.48550/arXiv.2411.08294 / Published by ArXiv / Version released on 2024-11-13 / on (web) Publishing site


Bias in Large Language Models: Origin, Evaluation, and Mitigation / 2411.10915 / ISBN:https://doi.org/10.48550/arXiv.2411.10915 / Published by ArXiv / Version released on 2026-05-01 / on (web) Publishing site


Clio: Privacy-Preserving Insights into Real-World AI Use / 2412.13678 / ISBN:https://doi.org/10.48550/arXiv.2412.13678 / Published by ArXiv / Version released on 2024-12-18 / on (web) Publishing site


FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing / 2502.03826 / ISBN:https://doi.org/10.48550/arXiv.2502.03826 / Published by ArXiv / Version released on 2025-08-15 / on (web) Publishing site


Cognitive AI framework 2.0: advances in the simulation of human thought / 2502.04259 / ISBN:https://doi.org/10.48550/arXiv.2502.04259 / Published by ArXiv / Version released on 2026-01-21 / on (web) Publishing site


Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety / 2502.05206 / ISBN:https://doi.org/10.48550/arXiv.2502.05206 / Published by ArXiv / Version released on 2026-04-14 / on (web) Publishing site


Prioritization First, Principles Second: An Adaptive Interpretation of Helpful, Honest, and Harmless Principles / 2502.06059 / ISBN:https://doi.org/10.48550/arXiv.2502.06059 / Published by ArXiv / Version released on 2025-12-27 / on (web) Publishing site


Multi-Agent Risks from Advanced AI / 2502.14143 / ISBN:https://doi.org/10.48550/arXiv.2502.14143 / Published by ArXiv / Version released on 2025-02-19 / on (web) Publishing site


On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective / 2502.14296 / ISBN:https://doi.org/10.48550/arXiv.2502.14296 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site


Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review / 2502.14886 / ISBN:https://doi.org/10.48550/arXiv.2502.14886 / Published by ArXiv / Version released on 2025-11-03 / on (web) Publishing site


Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives / 2502.16841 / ISBN:https://doi.org/10.48550/arXiv.2502.16841 / Published by ArXiv / Version released on 2026-01-14 / on (web) Publishing site


Medical Hallucinations in Foundation Models and Their Impact on Healthcare / 2503.05777 / ISBN:https://doi.org/10.48550/arXiv.2503.05777 / Published by ArXiv / Version released on 2025-02-26 / on (web) Publishing site


AI Governance InternationaL Evaluation Index (AGILE Index) 2024 / 2502.15859 / ISBN:https://doi.org/10.48550/arXiv.2502.15859 / Published by ArXiv / Version released on 2025-07-17 / on (web) Publishing site


BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models / 2503.24310 / ISBN:https://doi.org/10.48550/arXiv.2503.24310 / Published by ArXiv / Version released on 2025-03-31 / on (web) Publishing site


Who is Responsible When AI Fails? Mapping Causes, Entities, and Consequences of AI Privacy and Ethical Incidents / 2504.01029 / ISBN:https://doi.org/10.48550/arXiv.2504.01029 / Published by ArXiv / Version released on 2025-09-18 / on (web) Publishing site


A Framework for Developing University Policies on Generative AI Governance: A Cross-national Comparative Study / 2504.02636 / ISBN:https://doi.org/10.48550/arXiv.2504.02636 / Published by ArXiv / Version released on 2025-11-18 / on (web) Publishing site


Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation / 2502.05151 / ISBN:https://doi.org/10.48550/arXiv.2502.05151 / Published by ArXiv / Version released on 2026-03-05 / on (web) Publishing site


Framework, Standards, Applications and Best practices of Responsible AI : A Comprehensive Survey / 2504.13979 / ISBN:https://doi.org/10.48550/arXiv.2504.13979 / Published by ArXiv / Version released on 2025-04-18 / on (web) Publishing site


Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions / 2504.15236 / ISBN:https://doi.org/10.48550/arXiv.2504.15236 / Published by ArXiv / Version released on 2025-04-21 / on (web) Publishing site


TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models / 2504.20605 / ISBN:https://doi.org/10.48550/arXiv.2504.20605 / Published by ArXiv / Version released on 2026-05-02 / on (web) Publishing site


WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models / 2505.09595 / ISBN:https://doi.org/10.48550/arXiv.2505.09595 / Published by ArXiv / Version released on 2025-05-14 / on (web) Publishing site


From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery / 2505.13259 / ISBN:https://doi.org/10.48550/arXiv.2505.13259 / Published by ArXiv / Version released on 2025-09-17 / on (web) Publishing site


AI vs. Human Judgment of Content Moderation: LLM-as-a-Judge and Ethics-Based Response Refusals / 2505.15365 / ISBN:https://doi.org/10.48550/arXiv.2505.15365 / Published by ArXiv / Version released on 2025-05-21 / on (web) Publishing site


Are Language Models Consequentialist or Deontological Moral Reasoners? / 2505.21479 / ISBN:https://doi.org/10.48550/arXiv.2505.21479 / Published by ArXiv / Version released on 2025-10-12 / on (web) Publishing site


SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents / 2505.23559 / ISBN:https://doi.org/10.48550/arXiv.2505.23559 / Published by ArXiv / Version released on 2025-05-29 / on (web) Publishing site


Locating Risk: Task Designers and the Challenge of Risk Disclosure in RAI Content Work / 2505.24246 / ISBN:https://doi.org/10.48550/arXiv.2505.24246 / Published by ArXiv / Version released on 2026-03-31 / on (web) Publishing site


A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications / 2506.12594 / ISBN:https://doi.org/10.48550/arXiv.2506.12594 / Published by ArXiv / Version released on 2025-06-14 / on (web) Publishing site


Mechanistic Interpretability Needs Philosophy / 2506.18852 / ISBN:https://doi.org/10.48550/arXiv.2506.18852 / Published by ArXiv / Version released on 2025-06-23 / on (web) Publishing site


Towards the Digital Me: A vision of authentic Conversational Agents powered by personal Human Digital Twins / 2506.23826 / ISBN:https://doi.org/10.48550/arXiv.2506.23826 / Published by ArXiv / Version released on 2025-06-30 / on (web) Publishing site


Exploring Collaboration Patterns and Strategies in Human-AI Co-creation through the Lens of Agency: A Scoping Review of the Top-tier HCI Literature / 2507.06000 / ISBN:https://doi.org/10.48550/arXiv.2507.06000 / Published by ArXiv / Version released on 2025-09-26 / on (web) Publishing site


Model Cards Revisited: Bridging the Gap Between Theory and Practice for Ethical AI Requirements / 2507.06014 / ISBN:https://doi.org/10.48550/arXiv.2507.06014 / Published by ArXiv / Version released on 2025-07-08 / on (web) Publishing site


When Large Language Models Meet Law: Dual-Lens Taxonomy, Technical Advances, and Ethical Governance / 2507.07748 / ISBN:https://doi.org/10.48550/arXiv.2507.07748 / Published by ArXiv / Version released on 2025-07-10 / on (web) Publishing site


The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and Scientist / 2507.11810 / ISBN:https://doi.org/10.48550/arXiv.2507.11810 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site


Exploiting Jailbreaking Vulnerabilities in Generative AI to Bypass Ethical Safeguards for Facilitating Phishing Attacks / 2507.12185 / ISBN:https://doi.org/10.48550/arXiv.2507.12185 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site


Culling Misinformation from Gen AI: Toward Ethical Curation and Refinement / 2507.14242 / ISBN:https://doi.org/10.48550/arXiv.2507.14242 / Published by ArXiv / Version released on 2025-07-17 / on (web) Publishing site


Challenges of Trustworthy Federated Learning: What's Done, Current Trends and Remaining Work / 2507.15796 / ISBN:https://doi.org/10.48550/arXiv.2507.15796 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site


ADEPTS: A Capability Framework for Human-Centered Agent Design / 2507.15885 / ISBN:https://doi.org/10.48550/arXiv.2507.15885 / Published by ArXiv / Version released on 2025-07-18 / on (web) Publishing site


Advancing Responsible Innovation in Agentic AI: A study of Ethical Frameworks for Household Automation / 2507.15901 / ISBN:https://doi.org/10.48550/arXiv.2507.15901 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site


PRAC3 (Privacy, Reputation, Accountability, Consent, Credit, Compensation): Long Tailed Risks of Voice Actors in AI Data-Economy / 2507.16247 / ISBN:https://doi.org/10.48550/arXiv.2507.16247 / Published by ArXiv / Version released on 2025-07-22 / on (web) Publishing site


Defining ethically sourced code generation / 2507.19743 / ISBN:https://doi.org/10.48550/arXiv.2507.19743 / Published by ArXiv / Version released on 2025-07-26 / on (web) Publishing site


EthicAlly: a Prototype for AI-Powered Research Ethics Support for the Social Sciences and Humanities / 2508.00856 / ISBN:https://doi.org/10.48550/arXiv.2508.00856 / Published by ArXiv / Version released on 2025-07-15 / on (web) Publishing site


Development of management systems using artificial intelligence systems and machine learning methods for boards of directors (preprint, unofficial translation) / 2508.03769 / ISBN:https://doi.org/10.48550/arXiv.2508.03769 / Published by ArXiv / Version released on 2025-08-05 / on (web) Publishing site


Data and AI governance: Promoting equity, ethics, and fairness in large language models / 2508.03970 / ISBN:https://doi.org/10.48550/arXiv.2508.03970 / Published by ArXiv / Version released on 2025-08-05 / on (web) Publishing site


PrinciplismQA: A Philosophy-Grounded Approach to Assessing LLM-Human Clinical Medical Ethics Alignment / 2508.05132 / ISBN:https://doi.org/10.48550/arXiv.2508.05132 / Published by ArXiv / Version released on 2026-04-20 / on (web) Publishing site


A Methodological Framework and Questionnaire for Investigating Perceived Algorithmic Fairness / 2508.05281 / ISBN:https://doi.org/10.48550/arXiv.2508.05281 / Published by ArXiv / Version released on 2025-08-07 / on (web) Publishing site


The Fair Game: Auditing & Debiasing AI Algorithms Over Time / 2508.06443 / ISBN:https://doi.org/10.48550/arXiv.2508.06443 / Published by ArXiv / Version released on 2025-08-08 / on (web) Publishing site


A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems / 2508.07407 / ISBN:https://doi.org/10.48550/arXiv.2508.07407 / Published by ArXiv / Version released on 2025-08-31 / on (web) Publishing site


Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance / 2508.08789 / ISBN:https://doi.org/10.48550/arXiv.2508.08789 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


A Comprehensive Review of Datasets for Clinical Mental Health AI Systems / 2508.09809 / ISBN:https://doi.org/10.48550/arXiv.2508.09809 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


A Comprehensive Review of AI Agents: Transforming Possibilities in Technology and Beyond / 2508.11957 / ISBN:https://doi.org/10.48550/arXiv.2508.11957 / Published by ArXiv / Version released on 2025-08-16 / on (web) Publishing site


Design and Validation of a Responsible Artificial Intelligence-based System for the Referral of Diabetic Retinopathy Patients / 2508.12506 / ISBN:https://doi.org/10.48550/arXiv.2508.12506 / Published by ArXiv / Version released on 2025-08-17 / on (web) Publishing site


When AI Writes Back: Ethical Considerations by Physicians on AI-Drafted Patient Message Replies / 2508.13217 / ISBN:https://doi.org/10.48550/arXiv.2508.13217 / Published by ArXiv / Version released on 2025-08-17 / on (web) Publishing site


The Agent Behavior: Model, Governance and Challenges in the AI Digital Age / 2508.14415 / ISBN:https://doi.org/10.48550/arXiv.2508.14415 / Published by ArXiv / Version released on 2025-08-20 / on (web) Publishing site


Do Students Rely on AI? Analysis of Student-ChatGPT Conversations from a Field Study / 2508.20244 / ISBN:https://doi.org/10.48550/arXiv.2508.20244 / Published by ArXiv / Version released on 2025-08-27 / on (web) Publishing site


Bridging Minds and Machines: Toward an Integration of AI and Cognitive Science / 2508.20674 / ISBN:https://doi.org/10.48550/arXiv.2508.20674 / Published by ArXiv / Version released on 2025-08-28 / on (web) Publishing site


Beyond Prediction: Reinforcement Learning as the Defining Leap in Healthcare AI / 2508.21101 / ISBN:https://doi.org/10.48550/arXiv.2508.21101 / Published by ArXiv / Version released on 2025-08-28 / on (web) Publishing site


Between a Rock and a Hard Place: Exploiting Ethical Reasoning to Jailbreak LLMs / 2509.05367 / ISBN:https://doi.org/10.48550/arXiv.2509.05367 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site


ArGen: Auto-Regulation of Generative AI via GRPO and Policy-as-Code / 2509.07006 / ISBN:https://doi.org/10.48550/arXiv.2509.07006 / Published by ArXiv / Version released on 2025-09-06 / on (web) Publishing site


Enhancing Clinical Decision-Making: Integrating Multi-Agent Systems with Ethical AI Governance / 2504.03699 / ISBN:https://doi.org/10.48550/arXiv.2504.03699 / Published by ArXiv / Version released on 2025-09-22 / on (web) Publishing site


Web3 x AI Agents: Landscape, Integrations, and Foundational Challenges / 2508.02773 / ISBN:https://doi.org/10.48550/arXiv.2508.02773 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site


AI For Privacy in Smart Homes: Exploring How Leveraging AI-Powered Smart Devices Enhances Privacy Protection / 2509.14050 / ISBN:https://doi.org/10.48550/arXiv.2509.14050 / Published by ArXiv / Version released on 2025-09-17 / on (web) Publishing site


Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models / 2509.16332 / ISBN:https://doi.org/10.48550/arXiv.2509.16332 / Published by ArXiv / Version released on 2025-09-19 / on (web) Publishing site


Perceptions of AI Across Sectors: A Comparative Review of Public Attitudes / 2509.18233 / ISBN:https://doi.org/10.48550/arXiv.2509.18233 / Published by ArXiv / Version released on 2025-09-22 / on (web) Publishing site


TVS Sidekick: Challenges and Practical Insights from Deploying Large Language Models in the Enterprise / 2509.26482 / ISBN:https://doi.org/10.48550/arXiv.2509.26482 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site


FairAIED: Navigating Fairness, Bias, and Ethics in Educational AI Applications / 2407.18745 / ISBN:https://doi.org/10.48550/arXiv.2407.18745 / Published by ArXiv / Version released on 2025-11-02 / on (web) Publishing site


Fully Autonomous AI Agents Should Not be Developed / 2502.02649 / ISBN:https://doi.org/10.48550/arXiv.2502.02649 / Published by ArXiv / Version released on 2025-10-20 / on (web) Publishing site


The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs / 2506.11094 / ISBN:https://doi.org/10.48550/arXiv.2506.11094 / Published by ArXiv / Version released on 2025-10-30 / on (web) Publishing site


Making Power Explicable in AI: Analyzing, Understanding, and Redirecting Power to Operationalize Ethics in AI Technical Practice / 2510.10588 / ISBN:https://doi.org/10.48550/arXiv.2510.10588 / Published by ArXiv / Version released on 2025-10-12 / on (web) Publishing site


AI Alignment vs. AI Ethical Treatment: 10 Challenges / 2510.12844 / ISBN:https://doi.org/10.48550/arXiv.2510.12844 / Published by ArXiv / Version released on 2025-10-14 / on (web) Publishing site


Integration of AI in STEM Education, Addressing Ethical Challenges in K-12 Settings / 2510.19196 / ISBN:https://doi.org/10.48550/arXiv.2510.19196 / Published by ArXiv / Version released on 2025-10-22 / on (web) Publishing site


How Can AI Augment Access to Justice? Public Defenders' Perspectives on AI Adoption / 2510.22933 / ISBN:https://doi.org/10.48550/arXiv.2510.22933 / Published by ArXiv / Version released on 2026-04-25 / on (web) Publishing site


Diverse Human Value Alignment for Large Language Models via Ethical Reasoning / 2511.00379 / ISBN:https://doi.org/10.48550/arXiv.2511.00379 / Published by ArXiv / Version released on 2025-11-01 / on (web) Publishing site


When Machines Join the Moral Circle: The Persona Effect of Generative AI Agents in Collaborative Reasoning / 2511.01205 / ISBN:https://doi.org/10.48550/arXiv.2511.01205 / Published by ArXiv / Version released on 2026-03-22 / on (web) Publishing site


Systematizing LLM Persona Design: A Four-Quadrant Technical Taxonomy for AI Companion Applications / 2511.02979 / ISBN:https://doi.org/10.48550/arXiv.2511.02979 / Published by ArXiv / Version released on 2026-01-23 / on (web) Publishing site


Designing and Evaluating Malinowski's Lens: An AI-Native Educational Game for Ethnographic Learning / 2511.07682 / ISBN:https://doi.org/10.48550/arXiv.2511.07682 / Published by ArXiv / Version released on 2025-11-10 / on (web) Publishing site


BeautyGuard: Designing a Multi-Agent Roundtable System for Proactive Beauty Tech Compliance through Stakeholder Collaboration / 2511.12645 / ISBN:https://doi.org/10.48550/arXiv.2511.12645 / Published by ArXiv / Version released on 2025-11-18 / on (web) Publishing site


Navigating the Ethical and Societal Impacts of Generative AI in Higher Computing Education / 2511.15768 / ISBN:https://doi.org/10.48550/arXiv.2511.15768 / Published by ArXiv / Version released on 2026-02-16 / on (web) Publishing site


Cross-cultural value alignment frameworks for responsible AI governance: Evidence from China-West comparative analysis / 2511.17256 / ISBN:https://doi.org/10.48550/arXiv.2511.17256 / Published by ArXiv / Version released on 2025-11-21 / on (web) Publishing site


Morality in AI. A plea to embed morality in LLM architectures and frameworks / 2511.20689 / ISBN:https://doi.org/10.48550/arXiv.2511.20689 / Published by ArXiv / Version released on 2025-11-21 / on (web) Publishing site


A Brief History of Digital Twin Technology / 2511.20695 / ISBN:https://doi.org/10.48550/arXiv.2511.20695 / Published by ArXiv / Version released on 2025-11-24 / on (web) Publishing site


The Essentials of AI for Life and Society: A Full-Scale AI Literacy Course Accessible to All / 2512.04110 / ISBN:https://doi.org/10.48550/arXiv.2512.04110 / Published by ArXiv / Version released on 2025-11-30 / on (web) Publishing site


Human-controllable AI: Meaningful Human Control / 2512.04334 / ISBN:https://doi.org/10.48550/arXiv.2512.04334 / Published by ArXiv / Version released on 2026-02-19 / on (web) Publishing site


Opportunities and Challenges of Large Language Models for Low-Resource Languages in Humanities Research / 2412.04497 / ISBN:https://doi.org/10.48550/arXiv.2412.04497 / Published by ArXiv / Version released on 2026-04-17 / on (web) Publishing site


PrivacyBench: A Conversational Benchmark for Evaluating Privacy in Personalized AI / 2512.24848 / ISBN:https://doi.org/10.48550/arXiv.2512.24848 / Published by ArXiv / Version released on 2025-12-31 / on (web) Publishing site


Unseen Risks of Clinical Speech-to-Text Systems: Transparency, Privacy, and Reliability Challenges in AI-Driven Documentation / 2601.00382 / ISBN:https://doi.org/10.48550/arXiv.2601.00382 / Published by ArXiv / Version released on 2026-03-30 / on (web) Publishing site


Legal Alignment for Safe and Ethical AI / 2601.04175 / ISBN:https://doi.org/10.48550/arXiv.2601.04175 / Published by ArXiv / Version released on 2026-01-07 / on (web) Publishing site


Multi-turn Evaluation of Anthropomorphic Behaviours in Large Language Models / 2502.07077 / ISBN:https://doi.org/10.48550/arXiv.2502.07077 / Published by ArXiv / Version released on 2026-02-02 / on (web) Publishing site


Academic journals' AI policies fail to curb the surge in AI-assisted academic writing / 2512.06705 / ISBN:https://doi.org/10.48550/arXiv.2512.06705 / Published by ArXiv / Version released on 2026-01-20 / on (web) Publishing site


Reimagining Legal Fact Verification with GenAI: Toward Effective Human-AI Collaboration / 2602.06305 / ISBN:https://doi.org/10.48550/arXiv.2602.06305 / Published by ArXiv / Version released on 2026-02-09 / on (web) Publishing site


AI Systems in Text-Based Online Counselling: Ethical Considerations Across Three Implementation Approaches / 2601.08878 / ISBN:https://doi.org/10.48550/arXiv.2601.08878 / Published by ArXiv / Version released on 2026-01-12 / on (web) Publishing site


Improving the Safety and Trustworthiness of Medical AI via Multi-Agent Evaluation Loops / 2601.13268 / ISBN:https://doi.org/10.48550/arXiv.2601.13268 / Published by ArXiv / Version released on 2026-01-19 / on (web) Publishing site


Conversational AI for Social Good (CAI4SG): An Overview of Emerging Trends, Applications, and Challenges / 2601.15136 / ISBN:https://doi.org/10.48550/arXiv.2601.15136 / Published by ArXiv / Version released on 2026-01-21 / on (web) Publishing site


Artificial Intelligence for Inclusive Engineering Education: Advancing Equality, Diversity, and Ethical Leadership / 2602.02520 / ISBN:https://doi.org/10.48550/arXiv.2602.02520 / Published by ArXiv / Version released on 2026-01-24 / on (web) Publishing site


Futuring Social Assemblages: How Enmeshing AIs into Social Life Challenges the Individual and the Interpersonal / 2602.03958 / ISBN:https://doi.org/10.48550/arXiv.2602.03958 / Published by ArXiv / Version released on 2026-02-03 / on (web) Publishing site


Trustworthy AI Software Engineers / 2602.06310 / ISBN:https://doi.org/10.48550/arXiv.2602.06310 / Published by ArXiv / Version released on 2026-02-06 / on (web) Publishing site


Artificial Intelligence in Open Source Software Engineering: A Foundation for Sustainability / 2602.07071 / ISBN:https://doi.org/10.48550/arXiv.2602.07071 / Published by ArXiv / Version released on 2026-02-05 / on (web) Publishing site


Reliable and Responsible Foundation Models: A Comprehensive Survey / 2602.08145 / ISBN:https://doi.org/10.48550/arXiv.2602.08145 / Published by ArXiv / Version released on 2026-02-04 / on (web) Publishing site


Dark and Bright Side of Participatory Red-Teaming with Targets of Stereotyping for Eliciting Harmful Behaviors from Large Language Models / 2602.19124 / ISBN:https://doi.org/10.48550/arXiv.2602.19124 / Version released on 2026-02-22 / on (web) Publishing site


Must Read: A Comprehensive Survey of Computational Persuasion / 2505.07775 / ISBN:https://doi.org/10.48550/arXiv.2505.07775 / Version released on 2026-03-23 / on (web) Publishing site


COMPASS: The explainable agentic framework for Sovereignty, Sustainability, Compliance, and Ethics / 2603.11277 / ISBN:https://doi.org/10.48550/arXiv.2603.11277 / Version released on 2026-03-13 / on (web) Publishing site


Learning to Program Alongside AI: Critical Thinking, AI Ethics, and Gendered Patterns of German Secondary School Students / 2603.24197 / ISBN:https://doi.org/10.48550/arXiv.2603.24197 / Version released on 2026-03-27 / on (web) Publishing site


The Landscape of Generative AI in Information Systems: A Synthesis of Secondary Reviews and Research Agendas / 2603.11842 / ISBN:https://doi.org/10.48550/arXiv.2603.11842 / Version released on 2026-03-12 / on (web) Publishing site


Literary Narrative as Moral Probe : A Cross-System Framework for Evaluating AI Ethical Reasoning and Refusal Behavior / 2603.12615 / ISBN:https://doi.org/10.48550/arXiv.2603.12615 / Version released on 2026-03-13 / on (web) Publishing site


Six Interventions for the Responsible and Ethical Implementation of Medical AI Agents / 2603.13743 / ISBN:https://doi.org/10.48550/arXiv.2603.13743 / Version released on 2026-03-14 / on (web) Publishing site


Bridging the Gap in the Responsible AI Divides / 2603.14495 / ISBN:https://doi.org/10.48550/arXiv.2603.14495 / Version released on 2026-03-15 / on (web) Publishing site


An ontological approach to foster the convergence, interoperability and operationalization of frameworks for Trustworthy AI / 2604.11033 / ISBN:https://doi.org/10.48550/arXiv.2604.11033 / Version released on 2026-04-13 / on (web) Publishing site


Strategic Polysemy in AI Discourse: A Philosophical Analysis of Language, Hype, and Power / 2604.21043 / ISBN:https://doi.org/10.48550/arXiv.2604.21043 / Version released on 2026-04-22 / on (web) Publishing site


Ethics Testing: Proactive Identification of Generative AI System Harms / 2604.22089 / ISBN:https://doi.org/10.48550/arXiv.2604.22089 / Version released on 2026-04-23 / on (web) Publishing site


Reflections and New Directions for Human-Centered Large Language Models / 2605.06901 / ISBN:https://doi.org/10.48550/arXiv.2605.06901 / Version released on 2026-05-07 / on (web) Publishing site


LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey / 2505.00753 / ISBN:https://doi.org/10.48550/arXiv.2505.00753 / Version released on 2026-05-06 / on (web) Publishing site


The Thin Line Between Comprehension and Persuasion in LLMs / 2507.01936 / ISBN:https://doi.org/10.48550/arXiv.2507.01936 / Version released on 2026-04-18 / on (web) Publishing site