_
RobertoLofaro.com - Knowledge Portal - human-generated content
Change, with and without technology - human, AI, scraping readers welcome
for updates on publications, follow: on Instagram, Twitter, Patreon, YouTube, Kaggle metadata


_

You are now here: AI Ethics Primer - search within the bibliography - version 0.4 of 2023-12-13 > (tag cloud) >tag_selected: defending


Currently searching for:

if you need more than one keyword, modify and separate by underscore _
the list of search keywords can be up to 50 characters long


if you modify the keywords, press enter within the field to confirm the new search key

Tag: defending

Bibliography items where occurs: 49
Moral Responsibility for AI Systems / 2310.18040 / ISBN:https://doi.org/10.48550/arXiv.2310.18040 / Published by ArXiv / Version released on 2023-10-27 / on (web) Publishing site


Artificial Intelligence Ethics Education in Cybersecurity: Challenges and Opportunities: a focus group report / 2311.00903 / ISBN:https://doi.org/10.48550/arXiv.2311.00903 / Published by ArXiv / Version released on 2023-11-02 / on (web) Publishing site


Practical Cybersecurity Ethics: Mapping CyBOK to Ethical Concerns / 2311.10165 / ISBN:https://doi.org/10.48550/arXiv.2311.10165 / Published by ArXiv / Version released on 2023-11-16 / on (web) Publishing site


Survey on AI Ethics: A Socio-technical Perspective / 2311.17228 / ISBN:https://doi.org/10.48550/arXiv.2311.17228 / Published by ArXiv / Version released on 2025-11-04 / on (web) Publishing site


Responsible Artificial Intelligence: A Structured Literature Review / 2403.06910 / ISBN:https://doi.org/10.48550/arXiv.2403.06910 / Published by ArXiv / Version released on 2024-03-11 / on (web) Publishing site


Epistemic Power in AI Ethics Labor: Legitimizing Located Complaints / 2402.08171 / ISBN:https://doi.org/10.1145/3630106.3658973 / Published by ArXiv / Version released on 2024-04-17 / on (web) Publishing site


AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research / 2405.01859 / ISBN:https://doi.org/10.48550/arXiv.2405.01859 / Published by ArXiv / Version released on 2024-05-31 / on (web) Publishing site


Integrating Emotional and Linguistic Models for Ethical Compliance in Large Language Models / 2405.07076 / ISBN:https://doi.org/10.48550/arXiv.2405.07076 / Published by ArXiv / Version released on 2024-05-14 / on (web) Publishing site


A Comprehensive Overview of Large Language Models (LLMs) for Cyber Defences: Opportunities and Directions / 2405.14487 / ISBN:https://doi.org/10.48550/arXiv.2405.14487 / Published by ArXiv / Version released on 2024-05-23 / on (web) Publishing site


A Survey on Privacy Attacks Against Digital Twin Systems in AI-Robotics / 2406.18812 / ISBN:https://doi.org/10.48550/arXiv.2406.18812 / Published by ArXiv / Version released on 2024-06-27 / on (web) Publishing site


AI-Driven Chatbot for Intrusion Detection in Edge Networks: Enhancing Cybersecurity with Ethical User Consent / 2408.04281 / ISBN:https://doi.org/10.48550/arXiv.2408.04281 / Published by ArXiv / Version released on 2024-08-08 / on (web) Publishing site


Is Generative AI the Next Tactical Cyber Weapon For Threat Actors? Unforeseen Implications of AI Generated Cyber Attacks / 2408.12806 / ISBN:https://doi.org/10.48550/arXiv.2408.12806 / Published by ArXiv / Version released on 2024-08-23 / on (web) Publishing site


Enhancing transparency in AI-powered customer engagement / 2410.01809 / ISBN:https://doi.org/10.48550/arXiv.2410.01809 / Published by ArXiv / Version released on 2024-09-13 / on (web) Publishing site


Data Defenses Against Large Language Models / 2410.13138 / ISBN:https://doi.org/10.48550/arXiv.2410.13138 / Published by ArXiv / Version released on 2024-10-17 / on (web) Publishing site


Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems / 2410.13334 / ISBN:https://doi.org/10.48550/arXiv.2410.13334 / Published by ArXiv / Version released on 2024-10-23 / on (web) Publishing site


Jailbreaking and Mitigation of Vulnerabilities in Large Language Models / 2410.15236 / ISBN:https://doi.org/10.48550/arXiv.2410.15236 / Published by ArXiv / Version released on 2025-11-25 / on (web) Publishing site


Smoke Screens and Scapegoats: The Reality of General Data Protection Regulation Compliance -- Privacy and Ethics in the Case of Replika AI / 2411.04490 / ISBN:https://doi.org/10.48550/arXiv.2411.04490 / Published by ArXiv / Version released on 2024-11-07 / on (web) Publishing site


From Principles to Practice: A Deep Dive into AI Ethics and Regulations / 2412.04683 / ISBN:https://doi.org/10.48550/arXiv.2412.04683 / Published by ArXiv / Version released on 2025-02-06 / on (web) Publishing site


Technology as uncharted territory: Contextual integrity and the notion of AI as new ethical ground / 2412.05130 / ISBN:https://doi.org/10.48550/arXiv.2412.05130 / Published by ArXiv / Version released on 2025-08-27 / on (web) Publishing site


Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment / 2412.15114 / ISBN:https://doi.org/10.48550/arXiv.2412.15114 / Published by ArXiv / Version released on 2024-12-19 / on (web) Publishing site


Large Language Model Safety: A Holistic Survey / 2412.17686 / ISBN:https://doi.org/10.48550/arXiv.2412.17686 / Published by ArXiv / Version released on 2024-12-23 / on (web) Publishing site


Towards Safe AI Clinicians: A Comprehensive Study on Large Language Model Jailbreaking in Healthcare / 2501.18632 / ISBN:https://doi.org/10.48550/arXiv.2501.18632 / Published by ArXiv / Version released on 2025-01-27 / on (web) Publishing site


Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety / 2502.05206 / ISBN:https://doi.org/10.48550/arXiv.2502.05206 / Published by ArXiv / Version released on 2025-08-02 / on (web) Publishing site


Prioritization First, Principles Second: An Adaptive Interpretation of Helpful, Honest, and Harmless Principles / 2502.06059 / ISBN:https://doi.org/10.48550/arXiv.2502.06059 / Published by ArXiv / Version released on 2025-10-14 / on (web) Publishing site


On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective / 2502.14296 / ISBN:https://doi.org/10.48550/arXiv.2502.14296 / Published by ArXiv / Version released on 2025-09-30 / on (web) Publishing site


DarkBench: Benchmarking Dark Patterns in Large Language Models / 2503.10728 / ISBN:https://doi.org/10.48550/arXiv.2503.10728 / Published by ArXiv / Version released on 2025-03-13 / on (web) Publishing site


Designing AI-Enabled Countermeasures to Cognitive Warfare / 2504.11486 / ISBN:https://doi.org/10.48550/arXiv.2504.11486 / Published by ArXiv / Version released on 2025-04-14 / on (web) Publishing site


Regulating Next-Generation Implantable Brain-Computer Interfaces: Recommendations for Ethical Development and Implementation / 2506.12540 / ISBN:https://doi.org/10.48550/arXiv.2506.12540 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site


Foundation of Affective Computing and Interaction / 2506.15497 / ISBN:https://doi.org/10.48550/arXiv.2506.15497 / Published by ArXiv / Version released on 2025-06-18 / on (web) Publishing site


A Practical SAFE-AI Framework for Small and Medium-Sized Enterprises Developing Medical Artificial Intelligence Ethics Policies / 2507.01304 / ISBN:https://doi.org/10.48550/arXiv.2507.01304 / Published by ArXiv / Version released on 2025-07-02 / on (web) Publishing site


When Large Language Models Meet Law: Dual-Lens Taxonomy, Technical Advances, and Ethical Governance / 2507.07748 / ISBN:https://doi.org/10.48550/arXiv.2507.07748 / Published by ArXiv / Version released on 2025-07-10 / on (web) Publishing site


The AI Ethical Resonance Hypothesis: The Possibility of Discovering Moral Meta-Patterns in AI Systems / 2507.11552 / ISBN:https://doi.org/10.48550/arXiv.2507.11552 / Published by ArXiv / Version released on 2025-07-13 / on (web) Publishing site


Exploiting Jailbreaking Vulnerabilities in Generative AI to Bypass Ethical Safeguards for Facilitating Phishing Attacks / 2507.12185 / ISBN:https://doi.org/10.48550/arXiv.2507.12185 / Published by ArXiv / Version released on 2025-07-16 / on (web) Publishing site


Challenges of Trustworthy Federated Learning: What's Done, Current Trends and Remaining Work / 2507.15796 / ISBN:https://doi.org/10.48550/arXiv.2507.15796 / Published by ArXiv / Version released on 2025-07-21 / on (web) Publishing site


ADEPTS: A Capability Framework for Human-Centered Agent Design / 2507.15885 / ISBN:https://doi.org/10.48550/arXiv.2507.15885 / Published by ArXiv / Version released on 2025-07-18 / on (web) Publishing site


Countering Privacy Nihilism / 2507.18253 / ISBN:https://doi.org/10.48550/arXiv.2507.18253 / Published by ArXiv / Version released on 2025-07-24 / on (web) Publishing site


Towards a Manifesto for Cyber Humanities: Paradigms, Ethics, and Prospects / 2508.02760 / ISBN:https://doi.org/10.48550/arXiv.2508.02760 / Published by ArXiv / Version released on 2025-08-03 / on (web) Publishing site


Think First, Verify Always: Training Humans to Face AI Risks / 2508.03714 / ISBN:https://doi.org/10.48550/arXiv.2508.03714 / Published by ArXiv / Version released on 2025-07-23 / on (web) Publishing site


Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance / 2508.08789 / ISBN:https://doi.org/10.48550/arXiv.2508.08789 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


Beyond Ethical Alignment: Evaluating LLMs as Artificial Moral Assistants / 2508.12754 / ISBN:https://doi.org/10.48550/arXiv.2508.12754 / Published by ArXiv / Version released on 2025-08-18 / on (web) Publishing site


Ethics of Artificial Intelligence / 2508.16658 / ISBN:https://doi.org/10.48550/arXiv.2508.16658 / Published by ArXiv / Version released on 2025-08-20 / on (web) Publishing site


Between a Rock and a Hard Place: Exploiting Ethical Reasoning to Jailbreak LLMs / 2509.05367 / ISBN:https://doi.org/10.48550/arXiv.2509.05367 / Published by ArXiv / Version released on 2025-09-12 / on (web) Publishing site


The Ultimate Test of Superintelligent AI Agents: Can an AI Balance Care and Control in Asymmetric Relationships? / 2506.01813 / ISBN:https://doi.org/10.48550/arXiv.2506.01813 / Published by ArXiv / Version released on 2025-09-29 / on (web) Publishing site


AI and the Future of Academic Peer Review / 2509.14189 / ISBN:https://doi.org/10.48550/arXiv.2509.14189 / Published by ArXiv / Version released on 2025-09-18 / on (web) Publishing site


Digital Sovereignty Control Framework for Military AI-based Cyber Security / 2509.13072 / ISBN:https://doi.org/10.48550/arXiv.2509.13072 / Published by ArXiv / Version released on 2025-09-16 / on (web) Publishing site


Using Generative Artificial Intelligence Creatively in the Classroom and Research: Examples and Lessons Learned / 2409.05176 / ISBN:https://doi.org/10.48550/arXiv.2409.05176 / Published by ArXiv / Version released on 2025-10-24 / on (web) Publishing site


The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs / 2506.11094 / ISBN:https://doi.org/10.48550/arXiv.2506.11094 / Published by ArXiv / Version released on 2025-10-30 / on (web) Publishing site


AI Alignment vs. AI Ethical Treatment: 10 Challenges / 2510.12844 / ISBN:https://doi.org/10.48550/arXiv.2510.12844 / Published by ArXiv / Version released on 2025-10-14 / on (web) Publishing site


Designing and Evaluating Malinowski's Lens: An AI-Native Educational Game for Ethnographic Learning / 2511.07682 / ISBN:https://doi.org/10.48550/arXiv.2511.07682 / Published by ArXiv / Version released on 2025-11-10 / on (web) Publishing site