_
RobertoLofaro.com - Knowledge Portal - human-generated content
Change, with and without technology - human, AI, scraping readers welcome
for updates on publications, follow: on Instagram, Twitter, Patreon, YouTube, Kaggle metadata


_

You are now here: AI Ethics Primer - search within the bibliography - version 0.4 of 2023-12-13 > (tag cloud) >tag_selected: anthropic


Currently searching for:

if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long


if you modify the keywords, press enter within the field to confirm the new search key

Tag: anthropic

Bibliography items where occurs: 119
The Promise and Peril of Artificial Intelligence -- Violet Teaming Offers a Balanced Path Forward / 2308.14253 / ISBN:https://doi.org/10.48550/arXiv.2308.14253 / Published by ArXiv / Version released on 2023-08-28 / on (web) Publishing site


The Impact of Artificial Intelligence on the Evolution of Digital Education: A Comparative Study of OpenAI Text Generation Tools including ChatGPT, Bing Chat, Bard, and Ernie / 2309.02029 / ISBN:https://doi.org/10.48550/arXiv.2309.02029 / Published by ArXiv / Version released on 2023-09-05 / on (web) Publishing site


STREAM: Social data and knowledge collective intelligence platform for TRaining Ethical AI Models / 2310.05563 / ISBN:https://doi.org/10.48550/arXiv.2310.05563 / Published by ArXiv / Version released on 2023-10-09 / on (web) Publishing site


Specific versus General Principles for Constitutional AI / 2310.13798 / ISBN:https://doi.org/10.48550/arXiv.2310.13798 / Published by ArXiv / Version released on 2023-10-20 / on (web) Publishing site


Unlocking the Potential of ChatGPT: A Comprehensive Exploration of its Applications, Advantages, Limitations, and Future Directions in Natural Language Processing / 2304.02017 / ISBN:https://doi.org/10.48550/arXiv.2304.02017 / Published by ArXiv / Version released on 2024-08-03 / on (web) Publishing site


How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities / 2311.09447 / ISBN:https://doi.org/10.48550/arXiv.2311.09447 / Published by ArXiv / Version released on 2024-04-02 / on (web) Publishing site


The Rise of Creative Machines: Exploring the Impact of Generative AI / 2311.13262 / ISBN:https://doi.org/10.48550/arXiv.2311.13262 / Published by ArXiv / Version released on 2023-11-22 / on (web) Publishing site


Control Risk for Potential Misuse of Artificial Intelligence in Science / 2312.06632 / ISBN:https://doi.org/10.48550/arXiv.2312.06632 / Published by ArXiv / Version released on 2023-12-11 / on (web) Publishing site


Exploring the Frontiers of LLMs in Psychological Applications: A Comprehensive Review / 2401.01519 / ISBN:https://doi.org/10.48550/arXiv.2401.01519 / Published by ArXiv / Version released on 2025-04-20 / on (web) Publishing site


AI Ethics: A Bibliometric Analysis, Critical Issues, and Key Gaps / 2403.14681 / ISBN:https://doi.org/10.48550/arXiv.2403.14681 / Published by ArXiv / Version released on 2024-03-12 / on (web) Publishing site


AI Act and Large Language Models (LLMs): When critical issues and privacy impact require human and ethical oversight / 2404.00600 / ISBN:https://doi.org/10.48550/arXiv.2404.00600 / Published by ArXiv / Version released on 2024-04-02 / on (web) Publishing site


A Review of Multi-Modal Large Language and Vision Models / 2404.01322 / ISBN:https://doi.org/10.48550/arXiv.2404.01322 / Published by ArXiv / Version released on 2024-03-28 / on (web) Publishing site


Frontier AI Ethics: Anticipating and Evaluating the Societal Impacts of Language Model Agents / 2404.06750 / ISBN:https://arxiv.org/abs/2404.06750 / Published by ArXiv / Version released on 2024-10-18 / on (web) Publishing site


AI Alignment: A Comprehensive Survey / 2310.19852 / ISBN:https://doi.org/10.48550/arXiv.2310.19852 / Published by ArXiv / Version released on 2025-04-04 / on (web) Publishing site


The Necessity of AI Audit Standards Boards / 2404.13060 / ISBN:https://doi.org/10.48550/arXiv.2404.13060 / Published by ArXiv / Version released on 2024-04-11 / on (web) Publishing site


A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and Generative AI / 2405.04333 / ISBN:https://doi.org/10.48550/arXiv.2405.04333 / Published by ArXiv / Version released on 2024-05-07 / on (web) Publishing site


Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback / 2404.10271 / ISBN:https://doi.org/10.48550/arXiv.2404.10271 / Published by ArXiv / Version released on 2024-06-04 / on (web) Publishing site


The Future of Child Development in the AI Era. Cross-Disciplinary Perspectives Between AI and Child Development Experts / 2405.19275 / ISBN:https://doi.org/10.48550/arXiv.2405.19275 / Published by ArXiv / Version released on 2024-05-29 / on (web) Publishing site


The AI Alignment Paradox / 2405.20806 / ISBN:https://doi.org/10.48550/arXiv.2405.20806 / Published by ArXiv / Version released on 2024-11-22 / on (web) Publishing site


Current state of LLM Risks and AI Guardrails / 2406.12934 / ISBN:https://doi.org/10.48550/arXiv.2406.12934 / Published by ArXiv / Version released on 2024-06-16 / on (web) Publishing site


Leveraging Large Language Models for Patient Engagement: The Power of Conversational AI in Digital Health / 2406.13659 / ISBN:https://doi.org/10.48550/arXiv.2406.13659 / Published by ArXiv / Version released on 2024-06-19 / on (web) Publishing site


AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations / 2406.18346 / ISBN:https://doi.org/10.48550/arXiv.2406.18346 / Published by ArXiv / Version released on 2024-06-26 / on (web) Publishing site


Bridging the Global Divide in AI Regulation: A Proposal for a Contextual, Coherent, and Commensurable Framework / 2303.11196 / ISBN:https://doi.org/10.48550/arXiv.2303.11196 / Published by ArXiv / Version released on 2024-07-15 / on (web) Publishing site


Have We Reached AGI? Comparing ChatGPT, Claude, and Gemini to Human Literacy and Education Benchmarks / 2407.09573 / ISBN:https://doi.org/10.48550/arXiv.2407.09573 / Published by ArXiv / Version released on 2024-07-11 / on (web) Publishing site


Prioritizing High-Consequence Biological Capabilities in Evaluations of Artificial Intelligence Models / 2407.13059 / ISBN:https://doi.org/10.48550/arXiv.2407.13059 / Published by ArXiv / Version released on 2024-07-23 / on (web) Publishing site


Assurance of AI Systems From a Dependability Perspective / 2407.13948 / ISBN:https://doi.org/10.48550/arXiv.2407.13948 / Published by ArXiv / Version released on 2024-08-07 / on (web) Publishing site


Mapping the individual, social, and biospheric impacts of Foundation Models / 2407.17129 / ISBN:https://doi.org/10.48550/arXiv.2407.17129 / Published by ArXiv / Version released on 2024-07-24 / on (web) Publishing site


Surveys Considered Harmful? Reflecting on the Use of Surveys in AI Research, Development, and Governance / 2408.01458 / ISBN:https://doi.org/10.48550/arXiv.2408.01458 / Published by ArXiv / Version released on 2024-07-26 / on (web) Publishing site


Between Copyright and Computer Science: The Law and Ethics of Generative AI / 2403.14653 / ISBN:https://doi.org/10.48550/arXiv.2403.14653 / Published by ArXiv / Version released on 2024-09-05 / on (web) Publishing site


The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources / 2406.16746 / ISBN:https://doi.org/10.48550/arXiv.2406.16746 / Published by ArXiv / Version released on 2024-09-03 / on (web) Publishing site


DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection / 2409.06072 / ISBN:https://doi.org/10.48550/arXiv.2409.06072 / Published by ArXiv / Version released on 2024-09-09 / on (web) Publishing site


Large language models as linguistic simulators and cognitive models in human research / 2402.04470 / ISBN:https://doi.org/10.48550/arXiv.2402.04470 / Published by ArXiv / Version released on 2024-10-20 / on (web) Publishing site


Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure / 2409.19104 / ISBN:https://doi.org/10.48550/arXiv.2409.19104 / Published by ArXiv / Version released on 2024-09-27 / on (web) Publishing site


DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life / 2410.02683 / ISBN:https://doi.org/10.48550/arXiv.2410.02683 / Published by ArXiv / Version released on 2025-03-15 / on (web) Publishing site


AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models / 2410.07561 / ISBN:https://doi.org/10.48550/arXiv.2410.07561 / Published by ArXiv / Version released on 2024-12-12 / on (web) Publishing site


How Do AI Companies Fine-Tune Policy? Examining Regulatory Capture in AI Governance / 2410.13042 / ISBN:https://doi.org/10.48550/arXiv.2410.13042 / Published by ArXiv / Version released on 2024-10-16 / on (web) Publishing site


Data Defenses Against Large Language Models / 2410.13138 / ISBN:https://doi.org/10.48550/arXiv.2410.13138 / Published by ArXiv / Version released on 2024-10-17 / on (web) Publishing site


Demystifying Large Language Models for Medicine: A Primer / 2410.18856 / ISBN:https://doi.org/10.48550/arXiv.2410.18856 / Published by ArXiv / Version released on 2024-11-20 / on (web) Publishing site


Moral Agency in Silico: Exploring Free Will in Large Language Models / 2410.23310 / ISBN:https://doi.org/10.48550/arXiv.2410.23310 / Published by ArXiv / Version released on 2024-10-29 / on (web) Publishing site


Large-scale moral machine experiment on large language models / 2411.06790 / ISBN:https://doi.org/10.48550/arXiv.2411.06790 / Published by ArXiv / Version released on 2024-12-30 / on (web) Publishing site


Persuasion with Large Language Models: a Survey / 2411.06837 / ISBN:https://doi.org/10.48550/arXiv.2411.06837 / Published by ArXiv / Version released on 2024-11-11 / on (web) Publishing site


Chat Bankman-Fried: an Exploration of LLM Alignment in Finance / 2411.11853 / ISBN:https://doi.org/10.48550/arXiv.2411.11853 / Published by ArXiv / Version released on 2024-11-21 / on (web) Publishing site


Towards a Practical Ethics of Generative AI in Creative Production Processes / 2412.03579 / ISBN:https://doi.org/10.48550/arXiv.2412.03579 / Published by ArXiv / Version released on 2024-11-18 / on (web) Publishing site


Shaping AI's Impact on Billions of Lives / 2412.02730 / ISBN:https://doi.org/10.48550/arXiv.2412.02730 / Published by ArXiv / Version released on 2024-12-11 / on (web) Publishing site


Clio: Privacy-Preserving Insights into Real-World AI Use / 2412.13678 / ISBN:https://doi.org/10.48550/arXiv.2412.13678 / Published by ArXiv / Version released on 2024-12-18 / on (web) Publishing site


Large Language Model Safety: A Holistic Survey / 2412.17686 / ISBN:https://doi.org/10.48550/arXiv.2412.17686 / Published by ArXiv / Version released on 2024-12-23 / on (web) Publishing site


Hybrid Approaches for Moral Value Alignment in AI Agents: a Manifesto / 2312.01818 / ISBN:https://doi.org/10.48550/arXiv.2312.01818 / Published by ArXiv / Version released on 2025-01-16 / on (web) Publishing site


Development of Application-Specific Large Language Models to Facilitate Research Ethics Review / 2501.10741 / ISBN:https://doi.org/10.48550/arXiv.2501.10741 / Published by ArXiv / Version released on 2025-01-18 / on (web) Publishing site


FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing / 2502.03826 / ISBN:https://doi.org/10.48550/arXiv.2502.03826 / Published by ArXiv / Version released on 2025-08-15 / on (web) Publishing site


Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety / 2502.05206 / ISBN:https://doi.org/10.48550/arXiv.2502.05206 / Published by ArXiv / Version released on 2025-08-02 / on (web) Publishing site


A Conceptual Exploration of Generative AI-Induced Cognitive Dissonance and its Emergence in University-Level Academic Writing / 2502.05698 / ISBN:https://doi.org/10.48550/arXiv.2502.05698 / Published by ArXiv / Version released on 2025-02-08 / on (web) Publishing site


Multi-Agent Risks from Advanced AI / 2502.14143 / ISBN:https://doi.org/10.48550/arXiv.2502.14143 / Published by ArXiv / Version released on 2025-02-19 / on (web) Publishing site


On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective / 2502.14296 / ISBN:https://doi.org/10.48550/arXiv.2502.14296 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site


An LLM-based Delphi Study to Predict GenAI Evolution / 2502.21092 / ISBN:https://doi.org/10.48550/arXiv.2502.21092 / Published by ArXiv / Version released on 2025-02-28 / on (web) Publishing site


Medical Hallucinations in Foundation Models and Their Impact on Healthcare / 2503.05777 / ISBN:https://doi.org/10.48550/arXiv.2503.05777 / Published by ArXiv / Version released on 2025-02-26 / on (web) Publishing site


MinorBench: A hand-built benchmark for content-based risks for children / 2503.10242 / ISBN:https://doi.org/10.48550/arXiv.2503.10242 / Published by ArXiv / Version released on 2025-03-13 / on (web) Publishing site


DarkBench: Benchmarking Dark Patterns in Large Language Models / 2503.10728 / ISBN:https://doi.org/10.48550/arXiv.2503.10728 / Published by ArXiv / Version released on 2025-03-13 / on (web) Publishing site


Policy Frameworks for Transparent Chain-of-Thought Reasoning in Large Language Models / 2503.14521 / ISBN:https://doi.org/10.48550/arXiv.2503.14521 / Published by ArXiv / Version released on 2025-03-14 / on (web) Publishing site


A Peek Behind the Curtain: Using Step-Around Prompt Engineering to Identify Bias and Misinformation in GenAI Models / 2503.15205 / ISBN:https://doi.org/10.48550/arXiv.2503.15205 / Published by ArXiv / Version released on 2025-03-19 / on (web) Publishing site


BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models / 2503.24310 / ISBN:https://doi.org/10.48550/arXiv.2503.24310 / Published by ArXiv / Version released on 2025-03-31 / on (web) Publishing site


Towards interactive evaluations for interaction harms in human-AI systems / 2405.10632 / ISBN:https://doi.org/10.48550/arXiv.2405.10632 / Published by ArXiv / Version released on 2025-07-30 / on (web) Publishing site


Who is Responsible? The Data, Models, Users or Regulations? A Comprehensive Survey on Responsible Generative AI for a Sustainable Future / 2502.08650 / ISBN:https://doi.org/10.48550/arXiv.2502.08650 / Published by ArXiv / Version released on 2025-04-28 / on (web) Publishing site


Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions / 2504.15236 / ISBN:https://doi.org/10.48550/arXiv.2504.15236 / Published by ArXiv / Version released on 2025-04-21 / on (web) Publishing site


Towards responsible AI for education: Hybrid human-AI to confront the Elephant in the room / 2504.16148 / ISBN:https://doi.org/10.48550/arXiv.2504.16148 / Published by ArXiv / Version released on 2025-04-22 / on (web) Publishing site


Auditing the Ethical Logic of Generative AI Models / 2504.17544 / ISBN:https://doi.org/10.48550/arXiv.2504.17544 / Published by ArXiv / Version released on 2025-04-24 / on (web) Publishing site


The Convergent Ethics of AI? Analyzing Moral Foundation Priorities in Large Language Models with a Multi-Framework Approach / 2504.19255 / ISBN:https://doi.org/10.48550/arXiv.2504.19255 / Published by ArXiv / Version released on 2025-04-27 / on (web) Publishing site


GenAI in Entrepreneurship: a systematic review of generative artificial intelligence in entrepreneurship research: current issues and future directions / 2505.05523 / ISBN:https://doi.org/10.48550/arXiv.2505.05523 / Published by ArXiv / Version released on 2025-05-08 / on (web) Publishing site


Ethics and Persuasion in Reinforcement Learning from Human Feedback: A Procedural Rhetorical Approach / 2505.09576 / ISBN:https://doi.org/10.48550/arXiv.2505.09576 / Published by ArXiv / Version released on 2025-05-14 / on (web) Publishing site


WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models / 2505.09595 / ISBN:https://doi.org/10.48550/arXiv.2505.09595 / Published by ArXiv / Version released on 2025-05-14 / on (web) Publishing site


AI vs. Human Judgment of Content Moderation: LLM-as-a-Judge and Ethics-Based Response Refusals / 2505.15365 / ISBN:https://doi.org/10.48550/arXiv.2505.15365 / Published by ArXiv / Version released on 2025-05-21 / on (web) Publishing site


Making Sense of the Unsensible: Reflection, Survey, and Challenges for XAI in Large Language Models Toward Human-Centered AI / 2505.20305 / ISBN:https://doi.org/10.48550/arXiv.2505.20305 / Published by ArXiv / Version released on 2025-05-18 / on (web) Publishing site


Are Language Models Consequentialist or Deontological Moral Reasoners? / 2505.21479 / ISBN:https://doi.org/10.48550/arXiv.2505.21479 / Published by ArXiv / Version released on 2025-10-12 / on (web) Publishing site


Wide Reflective Equilibrium in LLM Alignment: Bridging Moral Epistemology and AI Safety / 2506.00415 / ISBN:https://doi.org/10.48550/arXiv.2506.00415 / Published by ArXiv / Version released on 2025-05-31 / on (web) Publishing site


DeepSeek in Healthcare: A Survey of Capabilities, Risks, and Clinical Applications of Open-Source Large Language Models / 2506.01257 / ISBN:https://doi.org/10.48550/arXiv.2506.01257 / Published by ArXiv / Version released on 2025-06-02 / on (web) Publishing site


Subjective Experience in AI Systems: What Do AI Researchers and the Public Believe? / 2506.11945 / ISBN:https://doi.org/10.48550/arXiv.2506.11945 / Published by ArXiv / Version released on 2025-06-13 / on (web) Publishing site


A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications / 2506.12594 / ISBN:https://doi.org/10.48550/arXiv.2506.12594 / Published by ArXiv / Version released on 2025-06-14 / on (web) Publishing site


Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs / 2506.13082 / ISBN:https://doi.org/10.48550/arXiv.2506.13082 / Published by ArXiv / Version released on 2025-10-06 / on (web) Publishing site


AI based Content Creation and Product Recommendation Applications in E-commerce: An Ethical overview / 2506.17370 / ISBN:https://doi.org/10.48550/arXiv.2506.17370 / Published by ArXiv / Version released on 2025-06-20 / on (web) Publishing site


Mechanistic Interpretability Needs Philosophy / 2506.18852 / ISBN:https://doi.org/10.48550/arXiv.2506.18852 / Published by ArXiv / Version released on 2025-06-23 / on (web) Publishing site


On the Surprising Efficacy of LLMs for Penetration-Testing / 2507.00829 / ISBN:https://doi.org/10.48550/arXiv.2507.00829 / Published by ArXiv / Version released on 2025-07-01 / on (web) Publishing site


Moral Responsibility or Obedience: What Do We Want from AI? / 2507.02788 / ISBN:https://doi.org/10.48550/arXiv.2507.02788 / Published by ArXiv / Version released on 2025-07-03 / on (web) Publishing site


Redefining Elderly Care with Agentic AI: Challenges and Opportunities / 2507.14912 / ISBN:https://doi.org/10.48550/arXiv.2507.14912 / Published by ArXiv / Version released on 2025-07-20 / on (web) Publishing site


ADEPTS: A Capability Framework for Human-Centered Agent Design / 2507.15885 / ISBN:https://doi.org/10.48550/arXiv.2507.15885 / Published by ArXiv / Version released on 2025-07-18 / on (web) Publishing site


Defining ethically sourced code generation / 2507.19743 / ISBN:https://doi.org/10.48550/arXiv.2507.19743 / Published by ArXiv / Version released on 2025-07-26 / on (web) Publishing site


EthicAlly: a Prototype for AI-Powered Research Ethics Support for the Social Sciences and Humanities / 2508.00856 / ISBN:https://doi.org/10.48550/arXiv.2508.00856 / Published by ArXiv / Version released on 2025-07-15 / on (web) Publishing site


Towards Assessing Medical Ethics from Knowledge to Practice / 2508.05132 / ISBN:https://doi.org/10.48550/arXiv.2508.05132 / Published by ArXiv / Version released on 2025-08-07 / on (web) Publishing site


A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems / 2508.07407 / ISBN:https://doi.org/10.48550/arXiv.2508.07407 / Published by ArXiv / Version released on 2025-08-31 / on (web) Publishing site


Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance / 2508.08789 / ISBN:https://doi.org/10.48550/arXiv.2508.08789 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


A Comprehensive Review of AI Agents: Transforming Possibilities in Technology and Beyond / 2508.11957 / ISBN:https://doi.org/10.48550/arXiv.2508.11957 / Published by ArXiv / Version released on 2025-08-16 / on (web) Publishing site


CAI Fluency: A Framework for Cybersecurity AI Fluency / 2508.13588 / ISBN:https://doi.org/10.48550/arXiv.2508.13588 / Published by ArXiv / Version released on 2025-10-07 / on (web) Publishing site


The AI-Fraud Diamond: A Novel Lens for Auditing Algorithmic Deception / 2508.13984 / ISBN:https://doi.org/10.48550/arXiv.2508.13984 / Published by ArXiv / Version released on 2025-08-19 / on (web) Publishing site


AI as IA: The use and abuse of artificial intelligence (AI) for human enhancement through intellectual augmentation (IA) / 2508.16642 / ISBN:https://doi.org/10.48550/arXiv.2508.16642 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


AI-Powered Legal Intelligence System Architecture: A Comprehensive Framework for Automated Legal Consultation and Analysis / 2508.17499 / ISBN:https://doi.org/10.48550/arXiv.2508.17499 / Published by ArXiv / Version released on 2025-08-24 / on (web) Publishing site


Do Students Rely on AI? Analysis of Student-ChatGPT Conversations from a Field Study / 2508.20244 / ISBN:https://doi.org/10.48550/arXiv.2508.20244 / Published by ArXiv / Version released on 2025-08-27 / on (web) Publishing site


A Study on the Framework for Evaluating the Ethics and Trustworthiness of Generative AI / 2509.00398 / ISBN:https://doi.org/10.48550/arXiv.2509.00398 / Published by ArXiv / Version released on 2025-10-28 / on (web) Publishing site


Designing LMS and Instructional Strategies for Integrating Generative-Conversational AI / 2509.00709 / ISBN:https://doi.org/10.48550/arXiv.2509.00709 / Published by ArXiv / Version released on 2025-08-31 / on (web) Publishing site


Between a Rock and a Hard Place: Exploiting Ethical Reasoning to Jailbreak LLMs / 2509.05367 / ISBN:https://doi.org/10.48550/arXiv.2509.05367 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site


AI Governance in Higher Education: A course design exploring regulatory, ethical and practical considerationsAI Governance in Higher Education: A course design exploring regulatory, ethical and practical considerations / 2509.06176 / ISBN:https://doi.org/10.48550/arXiv.2509.06176 / Published by ArXiv / Version released on 2025-09-16 / on (web) Publishing site


Evaluating the Clinical Safety of LLMs in Response to High-Risk Mental Health Disclosures / 2509.08839 / ISBN:https://doi.org/10.48550/arXiv.2509.08839 / Published by ArXiv / Version released on 2025-09-01 / on (web) Publishing site


Safe and Certifiable AI Systems: Concepts, Challenges, and Lessons Learned / 2509.08852 / ISBN:https://doi.org/10.48550/arXiv.2509.08852 / Published by ArXiv / Version released on 2025-09-08 / on (web) Publishing site


Web3 x AI Agents: Landscape, Integrations, and Foundational Challenges / 2508.02773 / ISBN:https://doi.org/10.48550/arXiv.2508.02773 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site


AI and the Future of Academic Peer Review / 2509.14189 / ISBN:https://doi.org/10.48550/arXiv.2509.14189 / Published by ArXiv / Version released on 2025-09-18 / on (web) Publishing site


AI For Privacy in Smart Homes: Exploring How Leveraging AI-Powered Smart Devices Enhances Privacy Protection / 2509.14050 / ISBN:https://doi.org/10.48550/arXiv.2509.14050 / Published by ArXiv / Version released on 2025-09-17 / on (web) Publishing site


AI Adoption Across Mission-Driven Organizations / 2510.03868 / ISBN:https://doi.org/10.48550/arXiv.2510.03868 / Published by ArXiv / Version released on 2025-10-04 / on (web) Publishing site


Fully Autonomous AI Agents Should Not be Developed / 2502.02649 / ISBN:https://doi.org/10.48550/arXiv.2502.02649 / Published by ArXiv / Version released on 2025-10-20 / on (web) Publishing site


Toward a Public and Secure Generative AI: A Comparative Analysis of Open and Closed LLMs / 2505.10603 / ISBN:https://doi.org/10.48550/arXiv.2505.10603 / Published by ArXiv / Version released on 2025-10-30 / on (web) Publishing site


AI Alignment vs. AI Ethical Treatment: 10 Challenges / 2510.12844 / ISBN:https://doi.org/10.48550/arXiv.2510.12844 / Published by ArXiv / Version released on 2025-10-14 / on (web) Publishing site


Systematizing LLM Persona Design: A Four-Quadrant Technical Taxonomy for AI Companion Applications / 2511.02979 / ISBN:https://doi.org/10.48550/arXiv.2511.02979 / Published by ArXiv / Version released on 2025-11-04 / on (web) Publishing site


SciSciGPT: Advancing Human-AI Collaboration in the Science of Science / 2504.05559 / ISBN:https://doi.org/10.48550/arXiv.2504.05559 / Published by ArXiv / Version released on 2025-11-27 / on (web) Publishing site


Hiding in the AI Traffic: Abusing MCP for LLM-Powered Agentic Red Teaming / 2511.15998 / ISBN:https://doi.org/10.48550/arXiv.2511.15998 / Published by ArXiv / Version released on 2025-11-21 / on (web) Publishing site


Significant Other AI: Identity, Memory, and Emotional Regulation as Long-Term Relational Intelligence / 2512.00418 / ISBN:https://doi.org/10.48550/arXiv.2512.00418 / Published by ArXiv / Version released on 2025-12-07 / on (web) Publishing site


Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development / 2511.20623 / ISBN:https://doi.org/10.48550/arXiv.2511.20623 / Published by ArXiv / Version released on 2025-11-25 / on (web) Publishing site


Who Owns the Knowledge? Copyright, GenAI, and the Future of Academic Publishing / 2511.21755 / ISBN:https://doi.org/10.48550/arXiv.2511.21755 / Published by ArXiv / Version released on 2025-11-24 / on (web) Publishing site


A Human-centric Framework for Debating the Ethics of AI Consciousness Under Uncertainty / 2512.02544 / ISBN:https://doi.org/10.48550/arXiv.2512.02544 / Published by ArXiv / Version released on 2025-12-02 / on (web) Publishing site


The Decision Path to Control AI Risks Completely: Fundamental Control Mechanisms for AI Governance / 2512.04489 / ISBN:https://doi.org/10.48550/arXiv.2512.04489 / Published by ArXiv / Version released on 2025-12-24 / on (web) Publishing site


Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research / 2512.10058 / ISBN:https://doi.org/10.48550/arXiv.2512.10058 / Published by ArXiv / Version released on 2025-12-10 / on (web) Publishing site


Opportunities and Challenges of Large Language Models for Low-Resource Languages in Humanities Research / 2412.04497 / ISBN:https://doi.org/10.48550/arXiv.2412.04497 / Published by ArXiv / Version released on 2026-01-05 / on (web) Publishing site


Ethics Practices in AI Development: An Empirical Study Across Roles and Regions / 2508.09219 / ISBN:https://doi.org/10.48550/arXiv.2508.09219 / Published by ArXiv / Version released on 2025-12-13 / on (web) Publishing site


Legal Alignment for Safe and Ethical AI / 2601.04175 / ISBN:https://doi.org/10.48550/arXiv.2601.04175 / Published by ArXiv / Version released on 2026-01-07 / on (web) Publishing site