_
RobertoLofaro.com - Knowledge Portal - human-generated content
Change, with and without technology - human, AI, scraping readers welcome
for updates on publications, follow: on Instagram, Twitter, Patreon, YouTube


_

You are now here: AI Ethics Primer - search within the bibliography - version 0.4 of 2023-12-13 > (tag cloud) >tag_selected: alice


Currently searching for:

if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long


if you modify the keywords, press enter within the field to confirm the new search key

Tag: alice

Bibliography items where occurs: 48
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation / 2305.11391 / ISBN:https://doi.org/10.48550/arXiv.2305.11391 / Published by ArXiv / Version released on 2023-08-27 / on (web) Publishing site


Building Trust in Conversational AI: A Comprehensive Review and Solution Architecture for Explainable, Privacy-Aware Systems using LLMs and Knowledge Graph / 2308.13534 / ISBN:https://doi.org/10.48550/arXiv.2308.13534 / Published by ArXiv / Version released on 2023-08-13 / on (web) Publishing site


Revolutionizing Customer Interactions: Insights and Challenges in Deploying ChatGPT and Generative Chatbots for FAQs / 2311.09976 / ISBN:https://doi.org/10.48550/arXiv.2311.09976 / Published by ArXiv / Version released on 2023-11-16 / on (web) Publishing site


Contra generative AI detection in higher education assessments / 2312.05241 / ISBN:https://doi.org/10.48550/arXiv.2312.05241 / Published by ArXiv / Version released on 2023-12-30 / on (web) Publishing site


Review of Generative AI Methods in Cybersecurity / 2403.08701 / ISBN:https://doi.org/10.48550/arXiv.2403.08701 / Published by ArXiv / Version released on 2024-03-19 / on (web) Publishing site


Modeling Emotions and Ethics with Large Language Models / 2404.13071 / ISBN:https://doi.org/10.48550/arXiv.2404.13071 / Published by ArXiv / Version released on 2024-06-25 / on (web) Publishing site


From Model Performance to Claim: How a Change of Focus in Machine Learning Replicability Can Help Bridge the Responsibility Gap / 2404.13131 / ISBN:https://doi.org/10.1145/3630106.3658951 / Published by ArXiv / Version released on 2025-08-13 / on (web) Publishing site


Not My Voice! A Taxonomy of Ethical and Safety Harms of Speech Generators / 2402.01708 / ISBN:https://doi.org/10.48550/arXiv.2402.01708 / Published by ArXiv / Version released on 2024-05-15 / on (web) Publishing site


The Wolf Within: Covert Injection of Malice into MLLM Societies via an MLLM Operative / 2402.14859 / ISBN:https://doi.org/10.48550/arXiv.2402.14859 / Published by ArXiv / Version released on 2024-06-03 / on (web) Publishing site


Deception Analysis with Artificial Intelligence: An Interdisciplinary Perspective / 2406.05724 / ISBN:https://doi.org/10.48550/arXiv.2406.05724 / Published by ArXiv / Version released on 2024-06-11 / on (web) Publishing site


Fair by design: A sociotechnical approach to justifying the fairness of AI-enabled systems across the lifecycle / 2406.09029 / ISBN:https://doi.org/10.48550/arXiv.2406.09029 / Published by ArXiv / Version released on 2024-06-13 / on (web) Publishing site


Thorns and Algorithms: Navigating Generative AI Challenges Inspired by Giraffes and Acacias / 2407.11360 / ISBN:https://doi.org/10.48550/arXiv.2407.11360 / Published by ArXiv / Version released on 2024-07.16 / on (web) Publishing site


Visualization Atlases: Explaining and Exploring Complex Topics through Data, Visualization, and Narration / 2408.07483 / ISBN:https://doi.org/10.48550/arXiv.2408.07483 / Published by ArXiv / Version released on 2024-08-14 / on (web) Publishing site


Don't Kill the Baby: The Case for AI in Arbitration / 2408.11608 / ISBN:https://doi.org/10.48550/arXiv.2408.11608 / Published by ArXiv / Version released on 2025-03-22 / on (web) Publishing site


Improving governance outcomes through AI documentation: Bridging theory and practice / 2409.08960 / ISBN:https://doi.org/10.48550/arXiv.2409.08960 / Published by ArXiv / Version released on 2024-12-09 / on (web) Publishing site


The Gradient of Health Data Privacy / 2410.00897 / ISBN:https://doi.org/10.48550/arXiv.2410.00897 / Published by ArXiv / Version released on 2024-10-01 / on (web) Publishing site


Data Defenses Against Large Language Models / 2410.13138 / ISBN:https://doi.org/10.48550/arXiv.2410.13138 / Published by ArXiv / Version released on 2024-10-17 / on (web) Publishing site


Good intentions, unintended consequences: exploring forecasting harms / 2411.16531 / ISBN:https://doi.org/10.48550/arXiv.2411.16531 / Published by ArXiv / Version released on 2025-03-12 / on (web) Publishing site


Concerns and Values in Human-Robot Interactions: A Focus on Social Robotics / 2501.05628 / ISBN:https://doi.org/10.48550/arXiv.2501.05628 / Published by ArXiv / Version released on 2025-12-07 / on (web) Publishing site


Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications / 2501.12456 / ISBN:https://doi.org/10.48550/arXiv.2501.12456 / Published by ArXiv / Version released on 2025-01-21 / on (web) Publishing site


Prioritization First, Principles Second: An Adaptive Interpretation of Helpful, Honest, and Harmless Principles / 2502.06059 / ISBN:https://doi.org/10.48550/arXiv.2502.06059 / Published by ArXiv / Version released on 2025-12-27 / on (web) Publishing site


Agentic AI for Scaling Diagnosis and Care in Neurodegenerative Disease / 2502.06842 / ISBN:https://doi.org/10.48550/arXiv.2502.06842 / Published by ArXiv / Version released on 2025-12-23 / on (web) Publishing site


On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective / 2502.14296 / ISBN:https://doi.org/10.48550/arXiv.2502.14296 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site


DarkBench: Benchmarking Dark Patterns in Large Language Models / 2503.10728 / ISBN:https://doi.org/10.48550/arXiv.2503.10728 / Published by ArXiv / Version released on 2025-03-13 / on (web) Publishing site


Towards responsible AI for education: Hybrid human-AI to confront the Elephant in the room / 2504.16148 / ISBN:https://doi.org/10.48550/arXiv.2504.16148 / Published by ArXiv / Version released on 2025-04-22 / on (web) Publishing site


AI Awareness / 2504.20084 / ISBN:https://doi.org/10.48550/arXiv.2504.20084 / Published by ArXiv / Version released on 2025-06-29 / on (web) Publishing site


Advancing the Scientific Method with Large Language Models: From Hypothesis to Discovery / 2505.16477 / ISBN:https://doi.org/10.48550/arXiv.2505.16477 / Published by ArXiv / Version released on 2025-05-22 / on (web) Publishing site


SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents / 2505.23559 / ISBN:https://doi.org/10.48550/arXiv.2505.23559 / Published by ArXiv / Version released on 2025-05-29 / on (web) Publishing site


Locating Risk: Task Designers and the Challenge of Risk Disclosure in RAI Content Work / 2505.24246 / ISBN:https://doi.org/10.48550/arXiv.2505.24246 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site


A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications / 2506.12594 / ISBN:https://doi.org/10.48550/arXiv.2506.12594 / Published by ArXiv / Version released on 2025-06-14 / on (web) Publishing site


Mechanistic Interpretability Needs Philosophy / 2506.18852 / ISBN:https://doi.org/10.48550/arXiv.2506.18852 / Published by ArXiv / Version released on 2025-06-23 / on (web) Publishing site


Exploring Collaboration Patterns and Strategies in Human-AI Co-creation through the Lens of Agency: A Scoping Review of the Top-tier HCI Literature / 2507.06000 / ISBN:https://doi.org/10.48550/arXiv.2507.06000 / Published by ArXiv / Version released on 2025-09-26 / on (web) Publishing site


A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems / 2508.07407 / ISBN:https://doi.org/10.48550/arXiv.2508.07407 / Published by ArXiv / Version released on 2025-08-31 / on (web) Publishing site


The Architecture of AI Transformation: Four Strategic Patterns and an Emerging Frontier / 2509.02853 / ISBN:https://doi.org/10.48550/arXiv.2509.02853 / Published by ArXiv / Version released on 2025-09-10 / on (web) Publishing site


Beyond Ethical Alignment: Evaluating LLMs as Artificial Moral Assistants / 2508.12754 / ISBN:https://doi.org/10.48550/arXiv.2508.12754 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


AI Governance in Higher Education: A course design exploring regulatory, ethical and practical considerationsAI Governance in Higher Education: A course design exploring regulatory, ethical and practical considerations / 2509.06176 / ISBN:https://doi.org/10.48550/arXiv.2509.06176 / Published by ArXiv / Version released on 2025-09-16 / on (web) Publishing site


Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models / 2509.16332 / ISBN:https://doi.org/10.48550/arXiv.2509.16332 / Published by ArXiv / Version released on 2025-09-19 / on (web) Publishing site


Teaching AI to Feel: A Collaborative, Full-Body Exploration of Emotive Communication / 2509.22168 / ISBN:https://doi.org/10.48550/arXiv.2509.22168 / Published by ArXiv / Version released on 2025-09-26 / on (web) Publishing site


The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs / 2506.11094 / ISBN:https://doi.org/10.48550/arXiv.2506.11094 / Published by ArXiv / Version released on 2025-10-30 / on (web) Publishing site


Making Power Explicable in AI: Analyzing, Understanding, and Redirecting Power to Operationalize Ethics in AI Technical Practice / 2510.10588 / ISBN:https://doi.org/10.48550/arXiv.2510.10588 / Published by ArXiv / Version released on 2025-10-12 / on (web) Publishing site


Systematizing LLM Persona Design: A Four-Quadrant Technical Taxonomy for AI Companion Applications / 2511.02979 / ISBN:https://doi.org/10.48550/arXiv.2511.02979 / Published by ArXiv / Version released on 2026-01-23 / on (web) Publishing site


BeautyGuard: Designing a Multi-Agent Roundtable System for Proactive Beauty Tech Compliance through Stakeholder Collaboration / 2511.12645 / ISBN:https://doi.org/10.48550/arXiv.2511.12645 / Published by ArXiv / Version released on 2025-11-18 / on (web) Publishing site


Medical Malice: A Dataset for Context-Aware Safety in Healthcare LLMs / 2511.21757 / ISBN:https://doi.org/10.48550/arXiv.2511.21757 / Published by ArXiv / Version released on 2025-11-24 / on (web) Publishing site


Beyond Abstract Compliance: Operationalising trust in AI as a moral relationship / 2601.22769 / ISBN:https://doi.org/10.48550/arXiv.2601.22769 / Published by ArXiv / Version released on 2026-01-30 / on (web) Publishing site


Disclose with Care: Designing Privacy Controls in Interview Chatbots / 2602.01387 / ISBN:https://doi.org/10.48550/arXiv.2602.01387 / Published by ArXiv / Version released on 2026-02-01 / on (web) Publishing site


Futuring Social Assemblages: How Enmeshing AIs into Social Life Challenges the Individual and the Interpersonal / 2602.03958 / ISBN:https://doi.org/10.48550/arXiv.2602.03958 / Published by ArXiv / Version released on 2026-02-03 / on (web) Publishing site


Artificial Intelligence in Open Source Software Engineering: A Foundation for Sustainability / 2602.07071 / ISBN:https://doi.org/10.48550/arXiv.2602.07071 / Published by ArXiv / Version released on 2026-02-05 / on (web) Publishing site


Reliable and Responsible Foundation Models: A Comprehensive Survey / 2602.08145 / ISBN:https://doi.org/10.48550/arXiv.2602.08145 / Published by ArXiv / Version released on 2026-02-04 / on (web) Publishing site