Naval Postgraduate School

Dudley Knox Library

Ask a Librarian My Accounts

Generative AI

What are the Privacy Issues?

Gen AI requires massive amounts of data
- Generative AI tools require training data, and lots of it. This can be databases, books, videos, websites, etc., all of which could contain personally identifiable information (PII). The training data may be obtained from multiple sources that contain personal information, often without an individual's consent. Large Language Models (LLMs) learn from memorization--which can include sensitive data.
Easily mimics human language
- Large Language Models (LLMs) can oftentime be mistaken for a real human due to the very human-like language. This could be misleading to those who are interacting with it, which may lead them to sharing PII.
Targets for malicious prompt engineering
- It is possible that generative AI models are susceptible to malicious prompt engineering. This could lead to misleading and harmful content, as well as the dissemination of false information.

What are the Intellectual Property Issues?

Originally unoriginal work
- Generative AI models use existing data--a lot of it--to produce new data. Creating new work from existing data is skewing ownership rights; a decision that has yet to be made even in the U.S. Copyright Office.
- These works can be essays, artwork, music, poems, summaries, reports, articles, and more.
Up to interpretation
- A number of legal cases have already been filed by original creators against Gen AI companies. The lawsuits allege improper use of creative works, violating copyright, and trademarks. Answers on what the legal definition is of a "derivative work" varies by jurisdiction, and similarly, an interpretation of the Fair Use Doctrine, which permits copyrighted material to be used without the owner's persmission "for purposes such as criticism (including satire), comment, news reporting, teaching (including multiple copies for classroom use), scholarship, or research" Legal Information Institute.

Readings on Gen AI Privacy & Intellectual Property Concerns

Copyright and Artificial Intelligence
"In early 2023, the Copyright Office launched an initiative to examine the copyright law and policy issues raised by artificial intelligence (AI) technology, including the scope of copyright in works generated using AI tools and the use of copyrighted materials in AI training. After convening public listening sessions and hosting public webinars to gather and share information about current technologies and their impact, the Office published a notice of inquiry in the Federal Register in August 2023."
CRS Legal Sidebar--Generative Artificial Intelligence and Copyright Law
"These generative AI programs are trained to generate such outputs partly by exposing them to large quantities of existing works such as writings, photos, paintings, and other artworks. This Legal Sidebar explores questions that courts and the U.S. Copyright Office have begun to confront regarding whether generative AI outputs may be copyrighted and how generative AI might infringe copyrights in other works."
CRS Report--Generative Artificial Intelligence and Data Privacy: A Primer
"Critics contend that such models rely on privacy-invasive methods for mass data collection, typically without the consent or compensation of the original user, creator, or owner. Additionally, some models may be trained on sensitive data and reveal personal information to users. In a company blog post, Google AI researchers noted, “Because these datasets can be large (hundreds of gigabytes) and pull from a range of sources, they can sometimes contain sensitive data, including personally identifiable information (PII)—names, phone numbers, addresses, etc., even if trained on public data.” Academic and industry research has found that some existing LLMs may reveal sensitive data or personal information from their training datasets."
From ChatGPT to ThreatGPT: Impact of Generative AI in Cybersecurity and Privacy
"This research paper highlights the limitations, challenges, potential risks, and opportunities of GenAI in the domain of cybersecurity and privacy. The work presents the vulnerabilities of ChatGPT, which can be exploited by malicious users to exfiltrate malicious information bypassing the ethical constraints on the model. This paper demonstrates successful example attacks like Jailbreaks, reverse psychology, and prompt injection attacks on the ChatGPT."
GenAIPABench: A Benchmark for Generative AI-based Privacy Assistants
"Privacy policies of websites are often lengthy and intricate. Privacy assistants assist in simplifying policies and making them more accessible and user friendly. The emergence of generative AI (genAI) offers new opportunities to build privacy assistants that can answer users questions about privacy policies. However, genAIs reliability is a concern due to its potential for producing inaccurate information. This study introduces GenAIPABench, a benchmark for evaluating Generative AI-based Privacy Assistants (GenAIPAs)."
Generative AI and US Intellectual Property Law
"The rapidity with which generative AI has been adopted and advanced has raised legal and ethical questions related to the impact on artists rights, content production, data collection, privacy, accuracy of information, and intellectual property rights. Recent administrative and case law challenges have shown that generative AI software systems do not have independent intellectual property rights in the content that they generate. It remains to be seen whether human content creators can retain their intellectual property rights against generative AI software, its developers, operators, and owners for the misappropriation of the work of human creatives, given the metes and bounds of existing law. Early signs from various courts are mixed as to whether and to what degree the results generated by AI models meet the legal standards of infringement under existing law."
Generative AI Has an Intellectual Property Problem
"Generative AI platforms are trained on data lakes and question snippets-- billions of parameters that are constructed by software processing huge archives of images and text [...] This process comes with legal risks, including intellectual property infringement. In many cases, it also poses legal questions that are still being resolved."
Generative Artificial Intelligence: 8 Critical Questions for Libraries
"In this article, we provide a brief overview of generative artificial intelligence (GenAI) and large language models (LLMs). We then propose eight critical questions that libraries should ask when exploring this technology and its implications for their communities. We argue that libraries have a unique role in facilitating informed and responsible use of GenAI, as well as safeguarding and promoting the values of access, privacy, and intellectual freedom."
The Legal Issues Presented by Generative AI
"Generative artificial intelligence, including large language models such as ChatGPT and image-generation software such as Stable Diffusion, are powerful new tools for individuals and businesses. They also raise profound and novel questions about how data is used in AI models and how the law applies to the output of those models, such as a paragraph of text or a computer-generated image."
Navigating Intellectual Property Rights in the Era of Generative AI: The Crucial Role of Educating Judicial Actors
"In the era of rapid technological advancements, the emergence of generative artificial intelligence has revolutionized numerous industries, including the creative sector. The EU is defining generative AI systems as “systems specifically intended to generate with varying levels of autonomy, content such as complex text, images, audio or video.”'
PPGAN: Privacy-Preserving Generative Adversarial Network
"Due to the gradient parameters of the deep neural network contain the data distribution of the training samples, they can easily remember the training samples. When GAN is applied to private or sensitive data, for instance, patient medical records, as private information may be leakage. To address this issue, we propose a Privacy-preserving Generative Adversarial Network (PPGAN) model, in which we achieve differential privacy in GANs by adding well-designed noise to the gradient during the model learning procedure."
Will Copyright Law Enable or Inhibit Generative AI?
"Emerging use cases around generative AI are disrupting traditional views of creativity, authorship and ownership and pushing the boundaries of copyright law. As the world catches up with innovation, the resulting legal ambiguity impacts all sides of the AI equation – developers, content creators and copyright owners."

Last Updated: Jun 12, 2025 9:18 AM
URL: https://libguides.nps.edu/gen-ai
Print Page

Subjects: Special Topics

Tags: AI, artificial intelligence, generative ai