Responsible AI

Data Security in GenAI: The Most Common Organizational Mistakes

In many companies, data security in GenAI is treated as a policy task: write the rules, communicate them, run training, and "close the risk." In practice, the risk remains because the biggest mistakes come not from mi…

Paweł Kubisiak·2026-06-01·8 min read

# Data Security in GenAI: The Most Common Organizational Mistakes

First comes time pressure: "we need to launch this use case fast." Then come compromises: temporary exceptions, manual workarounds, incomplete data classification, and unclear rules for using external tools. After a few months, the organization discovers that it has a formally strict policy and an operationally porous system.

The central thesis: data security in GenAI is a problem of work and accountability design, not just technical configuration. Organizations that do not embed controls in everyday data flows will remain in a constant catch-up cycle of incidents and audits.

Mistake 1: data classification only "on paper"

Companies often have information classification (public, internal, confidential), but do not translate it into concrete GenAI usage rules. Employees do not know what they can paste into an assistant, which data requires anonymization, or what the consequences are for violations.

Without mapping classification to concrete tasks, control becomes declarative. And declarative control fails under time pressure.

In practice, you need rules such as:

- which data classes are strictly excluded from external tools, - when masking or pseudonymization is required, - which logs and evidence must be produced for use of sensitive data.

ISO/IEC 27001:2022 and ISO/IEC 27701:2019 provide security and privacy management frameworks, but the organization must translate them into role-specific task instructions.

Mistake 2: assuming "the vendor handles security"

A vendor can provide solid infrastructure, but it does not take responsibility for what data employees send, how the model is used, and what business decisions are made from its output.

The most common vendor-risk gaps:

- missing clear clauses on data retention and downstream processing, - ambiguous terms on training with customer data, - limited transparency of logging and audit mechanisms, - no exit and data-recovery plan when changing providers.

That is why procurement and security must work together. The contract cannot be an "after-launch add-on."

Mistake 3: ignoring LLM-specific risks

OWASP Top 10 for LLM Applications (2025) shows that GenAI introduces a new class of threats: prompt injection, output-based leakage, uncontrolled tool use, excessive agent autonomy, and context capture.

Organizations that treat LLMs like ordinary SaaS apps often miss these vectors. The result: classic controls are in place, but the gap appears in the interaction layer between model and contextual data.

NIST AI RMF 1.0 (2023) emphasizes that AI risk management requires system-level assessment - from data and model to process usage. That means scenario testing, not just a configuration checklist.

Mistake 4: treating shadow AI as a "people problem," not a system problem

When the official path is too slow, employees look for shortcuts. Shadow AI rarely comes from rebellion. More often it comes from tool/process mismatch: poor availability, insufficient functionality, or approval workflows that are too complex.

Punitive messaging usually does not solve this. You need, in parallel:

- a usable official path, - clear boundary rules, - a fast channel for reporting needs and exceptions, - real manager support for decision-making.

ENISA Threat Landscape (2024) indicates that human factors and configuration errors remain key exposure sources. In GenAI, this pattern only accelerates.

Mistake 5: no AI-data incident response plan

Many organizations have a general security incident response plan, but no variant for GenAI-related incidents. This is a practical difference: you must quickly determine what data reached the model, where it may have been stored, who had access, and whether output was used downstream.

Without a prepared playbook, the first incident hours are chaotic. That increases damage and complicates communication with clients and regulators.

NIST CSF 2.0 (2024) highlights preparedness and recovery. In GenAI, this also means procedures to withdraw a specific workflow and immediately switch to manual mode.

Mistake 6: no contextual education

A one-off "AI security training" session has low effectiveness. Employees need micro-instructions embedded in role context:

- sales rep: what customer data can be entered and in what form, - analyst: how to anonymize cases and document sources, - HR: which candidate information is absolutely prohibited, - team lead: when to stop using the tool and escalate risk.

Contextual education is cheaper than incident remediation.

Mistake 7: no practical data minimization

Many teams declare a minimization principle, but in practice send more information than needed to the tool. The reason is usually operational: it is easier to paste full context than to prepare a safe summary.

This is where privacy by design must meet work ergonomics. If preparing secure input takes too long, users will choose risky shortcuts.

The solution: ready-to-use anonymization templates and "minimum necessary data" checklists for common scenarios.

Mistake 8: monitoring without quality signals

Some organizations monitor only activity: who uses the tool and how often. That is not enough for security. You also need quality signals:

- number of cases with incorrect confidentiality classification, - number of attempts to use prohibited data, - number of escalations related to suspicious output, - response time to potential-leak reports.

Without these indicators, the security team sees traffic but not risk.

The minimal 5x5 control model

A practical starter model includes 5 domains and 5 control questions.

Domains:

1. data and classification, 2. vendors and contracts, 3. workflows and permissions, 4. monitoring and audit, 5. incidents and recovery.

Questions:

- what can go wrong, - who is accountable, - how we detect it, - how we stop it, - how we prove control compliance.

If the team cannot answer these questions for a critical use case, it is not ready for full scale.

The 5x5 model should be extended with a simple "owner + evidence" rule: every answer must have an owner and operational evidence. A declaration without evidence is a control-gap signal.

Example: the question "how will we detect this?" does not end with "we have monitoring," but with a specific alert, threshold, and on-call owner.

"Bad -> good" deployment decision

Bad decision: "We will launch the assistant for all departments and refine the rules after the first month of usage."

Effect: fast adoption, but uncontrolled data flows, inconsistent practices across teams, and difficult event reconstruction after the first incident.

Good decision: "We will launch in stages - first low data-risk roles, with mandatory input classification, usage logging, and a ready incident playbook; we expand only after quality and risk review."

Effect: slower start, but lower systemic risk and better scaling capability.

What to do in 60 days

First, select 3 critical GenAI workflows and run a fast 5x5 review. Next, update contracts and retention procedures for active vendors. Then launch contextual training for roles with the highest data-exposure risk.

In parallel, prepare a dedicated AI-data incident playbook and run a tabletop exercise. It is better to expose gaps in a workshop than in a real incident.

In the final two weeks of the 60-day cycle, run a "red team lite" test for the two most important workflows. The goal is not a full offensive audit, but verifying whether the organization can detect and stop the most likely scenarios: prompt injection, output-based data exposure, and unauthorized data transfer.

How to combine security with deployment speed

The biggest tension in GenAI deployments is the conflict between speed and control. Organizations often choose one extreme: either fast rollout with high risk, or full control with near paralysis.

A practical compromise is two tracks:

- a fast track for low data-risk, low decision-impact use cases, - a controlled track for workflows with sensitive data, regulatory implications, or high customer impact.

Each track needs clearly defined gates. Then users get a predictable process, and security does not become a "last-minute brake."

The role of the board and risk committee

Data security in GenAI cannot be delegated solely to the CISO. The board should regularly review at least four signals:

- incident and near-miss trend, - time to close control gaps, - alignment between team practice and data policy, - contingency-readiness for critical workflows.

This rhythm is not for micromanagement. It ensures data risk is not discovered only during a reputation crisis.

Executive Takeaway

What changed? The biggest data security failures in GenAI come from a gap between policy and day-to-day workflow.

Why does it matter? Effective protection requires combining data classification, vendor-risk control, LLM risk testing, and incident readiness.

What should leaders do? Organizations should deploy GenAI in stages, with clear accountability and controls embedded in operational work.

Paweł Kubisiak

Partner at AI&Scale, Editor in Chief

Partner at AI&Scale and Editor in Chief, responsible for editorial quality and direction across AI transformation, governance and scaling coverage.

AI Governance & Law