_
RobertoLofaro.com - Knowledge Portal - human-generated content
Change, with and without technology - human, AI, scraping readers welcome
for updates on publications, follow: on Instagram, Twitter, Patreon, YouTube


_

You are now here: AI Ethics Primer - search within the bibliography - version 0.4 of 2023-12-13
An experimental downloadable Q&A AI model is available on Huggingface, CC-BY-SA-4.0, updated up to 2026-05-11 (update frequency: quarterly)

> (tag cloud) >tag_selected: ziegler


Currently searching for:

if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long


if you modify the keywords, press enter within the field to confirm the new search key

Tag: ziegler

Bibliography items where occurs: 32
The Cambridge Law Corpus: A Corpus for Legal AI Research / 2309.12269 / ISBN:https://doi.org/10.48550/arXiv.2309.12269 / Published by ArXiv / Version released on 2024-01-01 / on (web) Publishing site


A Survey on Human-AI Collaboration with Large Foundation Models / 2403.04931 / ISBN:https://doi.org/10.48550/arXiv.2403.04931 / Published by ArXiv / Version released on 2025-09-02 / on (web) Publishing site


AI Alignment: A Comprehensive Survey / 2310.19852 / ISBN:https://doi.org/10.48550/arXiv.2310.19852 / Published by ArXiv / Version released on 2025-04-04 / on (web) Publishing site


Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback / 2404.10271 / ISBN:https://doi.org/10.48550/arXiv.2404.10271 / Published by ArXiv / Version released on 2024-06-04 / on (web) Publishing site


On the Creativity of Large Language Models / 2304.00008 / ISBN:https://doi.org/10.48550/arXiv.2304.00008 / Published by ArXiv / Version released on 2024-09-18 / on (web) Publishing site


Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems / 2410.13334 / ISBN:https://doi.org/10.48550/arXiv.2410.13334 / Published by ArXiv / Version released on 2024-10-23 / on (web) Publishing site


Large Language Models in Politics and Democracy: A Comprehensive Survey / 2412.04498 / ISBN:https://doi.org/10.48550/arXiv.2412.04498 / Published by ArXiv / Version released on 2024-12-16 / on (web) Publishing site


Hybrid Approaches for Moral Value Alignment in AI Agents: a Manifesto / 2312.01818 / ISBN:https://doi.org/10.48550/arXiv.2312.01818 / Published by ArXiv / Version released on 2025-01-16 / on (web) Publishing site


Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety / 2502.05206 / ISBN:https://doi.org/10.48550/arXiv.2502.05206 / Published by ArXiv / Version released on 2026-04-14 / on (web) Publishing site


Prioritization First, Principles Second: An Adaptive Interpretation of Helpful, Honest, and Harmless Principles / 2502.06059 / ISBN:https://doi.org/10.48550/arXiv.2502.06059 / Published by ArXiv / Version released on 2025-12-27 / on (web) Publishing site


Multi-Agent Risks from Advanced AI / 2502.14143 / ISBN:https://doi.org/10.48550/arXiv.2502.14143 / Published by ArXiv / Version released on 2025-02-19 / on (web) Publishing site


Towards Safer Pretraining: Analyzing and Filtering Harmful Content in Webscale datasets for Responsible LLMs / 2505.02009 / ISBN:https://doi.org/10.48550/arXiv.2505.02009 / Published by ArXiv / Version released on 2025-08-12 / on (web) Publishing site


Are Language Models Consequentialist or Deontological Moral Reasoners? / 2505.21479 / ISBN:https://doi.org/10.48550/arXiv.2505.21479 / Published by ArXiv / Version released on 2025-10-12 / on (web) Publishing site


Advancing Responsible Innovation in Agentic AI: A study of Ethical Frameworks for Household Automation / 2507.15901 / ISBN:https://doi.org/10.48550/arXiv.2507.15901 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site


The Fair Game: Auditing & Debiasing AI Algorithms Over Time / 2508.06443 / ISBN:https://doi.org/10.48550/arXiv.2508.06443 / Published by ArXiv / Version released on 2025-08-08 / on (web) Publishing site


A Comprehensive Review of Datasets for Clinical Mental Health AI Systems / 2508.09809 / ISBN:https://doi.org/10.48550/arXiv.2508.09809 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


CAI Fluency: A Framework for Cybersecurity AI Fluency / 2508.13588 / ISBN:https://doi.org/10.48550/arXiv.2508.13588 / Published by ArXiv / Version released on 2025-10-07 / on (web) Publishing site


Towards Enhancing Data Equity in Public Health Data Science / 2508.20301 / ISBN:https://doi.org/10.48550/arXiv.2508.20301 / Published by ArXiv / Version released on 2025-08-27 / on (web) Publishing site


Between a Rock and a Hard Place: Exploiting Ethical Reasoning to Jailbreak LLMs / 2509.05367 / ISBN:https://doi.org/10.48550/arXiv.2509.05367 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site


Evaluating the Clinical Safety of LLMs in Response to High-Risk Mental Health Disclosures / 2509.08839 / ISBN:https://doi.org/10.48550/arXiv.2509.08839 / Published by ArXiv / Version released on 2025-09-01 / on (web) Publishing site


TVS Sidekick: Challenges and Practical Insights from Deploying Large Language Models in the Enterprise / 2509.26482 / ISBN:https://doi.org/10.48550/arXiv.2509.26482 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site


How Can AI Augment Access to Justice? Public Defenders' Perspectives on AI Adoption / 2510.22933 / ISBN:https://doi.org/10.48550/arXiv.2510.22933 / Published by ArXiv / Version released on 2026-04-25 / on (web) Publishing site


Knowing Ourselves Through Others: Reflecting with AI in Digital Human Debates / 2511.13046 / ISBN:https://doi.org/10.48550/arXiv.2511.13046 / Published by ArXiv / Version released on 2025-11-17 / on (web) Publishing site


On the Role and Impact of GenAI Tools in Software Engineering Education / 2512.04256 / ISBN:https://doi.org/10.48550/arXiv.2512.04256 / Published by ArXiv / Version released on 2025-12-03 / on (web) Publishing site


Multi-turn Evaluation of Anthropomorphic Behaviours in Large Language Models / 2502.07077 / ISBN:https://doi.org/10.48550/arXiv.2502.07077 / Published by ArXiv / Version released on 2026-02-02 / on (web) Publishing site


Guardrails for trust, safety, and ethical development and deployment of Large Language Models (LLM) / 2601.14298 / ISBN:https://doi.org/10.48550/arXiv.2601.14298 / Published by ArXiv / Version released on 2026-01-16 / on (web) Publishing site


Futuring Social Assemblages: How Enmeshing AIs into Social Life Challenges the Individual and the Interpersonal / 2602.03958 / ISBN:https://doi.org/10.48550/arXiv.2602.03958 / Published by ArXiv / Version released on 2026-02-03 / on (web) Publishing site


Trustworthy AI Software Engineers / 2602.06310 / ISBN:https://doi.org/10.48550/arXiv.2602.06310 / Published by ArXiv / Version released on 2026-02-06 / on (web) Publishing site


Must Read: A Comprehensive Survey of Computational Persuasion / 2505.07775 / ISBN:https://doi.org/10.48550/arXiv.2505.07775 / Version released on 2026-03-23 / on (web) Publishing site


Bridging the Gap in the Responsible AI Divides / 2603.14495 / ISBN:https://doi.org/10.48550/arXiv.2603.14495 / Version released on 2026-03-15 / on (web) Publishing site


Reflections and New Directions for Human-Centered Large Language Models / 2605.06901 / ISBN:https://doi.org/10.48550/arXiv.2605.06901 / Version released on 2026-05-07 / on (web) Publishing site


Co-Constructing Alignment: A Participatory Approach to Situate AI Values / 2601.15895 / ISBN:https://doi.org/10.48550/arXiv.2601.15895 / Version released on 2026-04-21 / on (web) Publishing site