Updated June 15, 2026
TL;DR: Standard SaaS due diligence often misses AI-specific risk categories that auditors increasingly ask about in AI vendor reviews: model opacity, training data lineage, algorithmic bias liability, prompt injection defenses, sub-processor chains, and model drift. This article gives you a tiered assessment process, a core checklist covering data handling, certifications, and contractual protections, and an evidence retention schedule aligned to NIST AI RMF, the EU AI Act, and SOC 2. You can hand the output to an auditor as a documented vendor risk program, not a spreadsheet of vendor names.
If you still apply a modified version of your existing IT vendor questionnaire to AI tools, the gap between that approach and what an audit actually requires is significant, and the EU AI Act compliance obligations now make that gap a legal exposure.
EU AI Act penalties are tiered: prohibited practices under Article 5 carry fines up to €35 million or 7% of global annual turnover, with high-risk AI system breaches under Article 99 carrying fines up to €15 million or 3% of turnover. That cost structure is concrete enough to bring to any board conversation.
This checklist covers what auditors expect to see, organized so you can run it before a contract is signed and maintain it through the full vendor lifecycle.
Traditional vendor reviews assess access controls, encryption, and uptime. They often don't adequately address the following risk categories that auditors conducting AI governance reviews increasingly ask about:
The NIST AI RMF GOVERN 1.1 function requires that legal and regulatory requirements involving AI are understood, managed, and documented. That obligation doesn't stop at your own models. It extends to every vendor providing AI capabilities to your organization. For context on how NIST's framework translates into operating practice for regulated organizations, the Practical AI episode with NIST's Chief AI Advisor Elham Tabassi walks through the design choices behind the AI RMF.
When auditors conduct AI governance reviews, they commonly ask for evidence across multiple documentation categories, and many organizations struggle to produce all of them on short notice. The following eight categories reflect what a well-documented vendor risk program should be able to surface:
The AICPA SOC 2 Trust Services Criteria define five criteria for evaluating security controls, with the Security criteria required in every report. CC9.2 specifically requires identification and evaluation of vulnerabilities arising from vendor and business partner relationships, which means your vendor risk documentation is itself subject to auditor scrutiny.
The steps below run in sequence: tiering the vendor first determines the scope of every subsequent activity, so skipping or reversing that order typically produces an assessment that is either over-engineered for low-risk tools or under-documented for critical ones.
Before you run a single questionnaire, classify the vendor. The depth of assessment should be proportional to risk: critical vendors handling sensitive data warrant full due diligence across all checklist categories, while low-risk vendors providing commodity services need only basic business verification and sanctions screening.
|
Dimension |
Tier 1: Critical |
Tier 2: High |
Tier 3: Medium |
Tier 4: Low |
|---|---|---|---|---|
|
Data sensitivity |
PHI, PII, financial, proprietary algorithms |
Internal confidential, non-sensitive customer data |
Lower-sensitivity data |
Minimal sensitivity data |
|
Assessment depth |
Full due diligence across all categories |
Security and legal review with detailed questionnaire |
Focused questionnaire plus certification check |
Standard due diligence appropriate to risk level, including business verification and sanctions screening |
|
Certification required |
Current SOC 2 Type II, ISO 27001 encouraged |
Current SOC 2 Type II or ISO 27001 recommended; certification requirement should reflect organizational risk tolerance and applicable regulatory context |
SOC 2 or equivalent third-party attestation recommended; requirement should reflect organizational risk tolerance and applicable regulatory context |
Vendor self-attestation acceptable; third-party attestation recommended where available |
|
Review frequency |
Full re-assessment annually, quarterly monitoring recommended |
Re-assessment every two years, semi-annual monitoring recommended |
Re-assessment every 18 months to two years recommended; annual monitoring check recommended |
Re-assessment every two to three years recommended, or when significant changes occur |
Assign a tier based on data sensitivity, business criticality if the service fails, integration depth, and whether the workload falls under GDPR, HIPAA, PCI DSS, or the EU AI Act. Scaling agentic AI workloads into production increases the tier classification of any vendor involved.
The questions below are organized by risk domain so you can scope each section to the vendor's tier without running the full checklist for every engagement. Require written responses with supporting documentation rather than verbal confirmation; the written record is what auditors will ask to see.
Frame these as information requests and require written responses with supporting documentation for each.
Data collection and purpose:
Data residency and jurisdiction:
Sub-processor transparency:
Data retention and deletion:
Certifications to require and how to verify them:
Technical controls to validate:
Verify these controls are in place and documented in every Tier 1 and Tier 2 vendor assessment:
The OWASP Top 10 for Agentic Applications recommends running extensions in the user's security context rather than with generic high-privileged identities, and ensuring authorization happens in external systems rather than being delegated to the AI model. Vendor architecture should reflect this.
Audit rights belong in the contract before signing. These provisions are non-negotiable for Tier 1 and Tier 2 vendors:
Vendor risk doesn't end at contract signature. It ends at offboarding, and only if offboarding produced a signed data deletion certificate.
A scheduled periodic review is not enough: these events require full re-assessment regardless of where you are in the review cycle, because fragmented AI tools create monitoring gaps that allow ungoverned vendor interactions to accumulate into systemic risk rather than isolated incidents.
These signals warrant an immediate walk-away decision, regardless of business pressure to proceed:
These aren't negotiating positions. They're structural signals that the vendor cannot meet the risk and compliance baseline your organization needs, and no contract language will close the gap. For a concrete illustration, the Microsoft Copilot security analysis covers the specific data leakage vectors that apply across AI tool categories when vendor controls are insufficient.
The GDPR Article 30 requirement to maintain records of processing activities now overlaps with EU AI Act Articles 12 and 26(6) obligations for AI-specific traceability. Your retention schedule should reflect both, aligned to the industry frameworks that govern your organization.
|
Document type |
Retention guidance |
Regulatory driver |
|---|---|---|
|
Vendor assessment questionnaires and risk ratings (dated) |
6 years minimum (HIPAA compliance documentation); 7 years (SOX audit documents); retention duration under GDPR based on purpose and necessity |
Compliance audit trail |
|
SOC 2 and ISO 27001 reports (full, not summaries) |
Retain current report plus prior reports per legal requirements and organizational risk tolerance; no universal minimum prescribed |
Demonstrates vendor control validation |
|
Data Processing Agreements |
6 years minimum (HIPAA compliance documentation); retain per contract terms and regulatory obligations; GDPR requires clear data handling processes post-termination |
GDPR Article 28 |
|
Security assessments and penetration testing results |
Retain per applicable regulatory framework; PCI DSS 4.0 requires a minimum of 12 months for penetration testing results and remediation activities; longer periods apply under SOX and other frameworks based on organizational risk tolerance |
Demonstrates due diligence |
|
Vendor incident reports and breach notifications |
Retain per industry standards and breach notification laws, with litigation hold if active investigation |
Regulatory investigation, breach law |
|
Change management logs for model updates |
Retain per applicable regulatory framework and organizational risk tolerance; no universal minimum prescribed for AI model change logs specifically |
Continuous monitoring evidence |
|
Audit rights exercise records and offboarding certificates |
Retain per applicable regulatory framework and contract terms; general healthcare and finance guidance suggests aligning with contract retention periods, but no specific regulatory requirement prescribes a minimum for these document types |
Governance trail |
Store documentation in a GRC platform with role-based access, AES-256 encryption at rest, access logging, and automated retention expiration with litigation hold capability. Spreadsheet-based tracking will not survive auditor scrutiny once you need to demonstrate dated workflows, approval chains, and continuous monitoring evidence.
The checklist above applies to every external AI vendor you evaluate. That raises a structural question: what happens when the governance infrastructure itself introduces third-party vendor risk?
Most AI governance point solutions, including external gateways and third-party filters, add a new third-party risk item to your environment. The governance logs they generate live outside your infrastructure, which means the audit log is outside your control. This is a structural problem, not a configuration gap. For context on how fragmented AI tooling creates this exposure, the Prediction Guard analysis of AI supply chain transparency is worth reviewing alongside your vendor assessment process.
Prediction Guard deploys a sovereign AI control plane inside your own infrastructure, whether on-premises, in a cloud VPC, or air-gapped, so governance logic and AI governance policies aligned to NIST AI RMF and the OWASP Agentic AI Top Ten are enforced inside your perimeter, and audit logs are generated inside your environment and consumed by your SIEM. When you register AI assets (models, MCP servers, datasets) into the control plane, the system produces an exportable AIBOM in CycloneDX format, the same format the OWASP AIBOM project recommends for AI supply chain transparency.
For enterprise context on how MCP and Kubernetes are reshaping production AI deployment, the Practical AI episode walks through the architecture decisions that precede vendor selection.
If your risk and compliance program is ready to move from manual vendor tracking to system-level AI governance, book a deployment scoping call to assess whether self-hosted deployment fits your infrastructure and audit requirements.
AI vendors introduce risk categories that standard SaaS checklists often don't adequately address: model opacity, training data lineage, algorithmic bias liability, real-time inference security including prompt injection, sub-processor model provider chains, and model drift. Each category has its own risk and compliance implications and requires specific contractual and technical controls beyond what a generic IT vendor questionnaire addresses.
A critical AI vendor handling sensitive data is strongly recommended to hold a current SOC 2 Type II report (covering the Security criteria at minimum) and ISO/IEC 27001 with a verified accrediting body; while neither is universally mandated across all jurisdictions or frameworks, both represent the de facto baseline for enterprise AI vendor due diligence. ISO/IEC 42001 is an emerging maturity signal for AI-specific governance. Verify all certifications at the issuing body's registry, not from vendor marketing pages, and confirm the report covers your specific use case scope.
Tier 1 vendors require annual full re-assessment with quarterly monitoring. Tier 2 vendors require re-assessment every two years with semi-annual monitoring. Outside the normal cycle, mandatory re-assessment triggers include security incidents, certification lapses, ownership changes, and new high-risk use cases, regardless of where you are in the review schedule.
At minimum, contracts must include the right to audit security, data protection, and AI governance practices with reasonable advance notice, up to twice per year for critical vendors, expedited audit rights for breach investigations, sub-processor audit rights with advance notice of changes, model explainability and bias testing access, remediation timelines of 15 days for critical findings, 30 days for high findings, and 60 days for medium findings, and full regulatory examination cooperation without requiring your advance permission.
Retention requirements vary by industry framework: 6 years for healthcare (HIPAA), 7 years for finance (SOX), a minimum of 3 years post-termination under GDPR for most processing records, and breach notification records retained per applicable regulation (periods vary by framework; litigation hold supersedes routine retention schedules when investigation is reasonably anticipated). Document your retention schedule against the specific frameworks governing your organization rather than applying a single blanket period.
AI Bill of Materials (AIBOM): A structured inventory of AI assets in a system, including models, training data, dependencies, and known limitations, exported in a machine-readable format such as CycloneDX for audit and supply chain transparency purposes.
SOC 2 Type II: An audit report issued by an accredited third party that tests the operating effectiveness of an organization's security controls over an observation period of 3 to 12 months, aligned to the AICPA Trust Services Criteria. Reports are typically considered current for 12 months from the end of the audit period.
ISO/IEC 42001: An international standard providing requirements for establishing and managing an AI management system, covering AI development lifecycle, risk management, documentation, and performance monitoring.
Data Processing Agreement (DPA): A legally binding contract between a data controller and a data processor that governs how the processor handles personal data, specifying purposes, retention, security obligations, and sub-processor requirements under GDPR Article 28.
Sub-processor: A third-party organization engaged by a vendor to process your data, including foundation model providers, cloud infrastructure, embedding services, and monitoring tools, each of which introduces downstream third-party risk that flows back to you as the contracting organization.
NIST AI RMF: The National Institute of Standards and Technology AI Risk Management Framework, organized around four functions (Govern, Map, Measure, Manage) that structure how organizations identify, assess, and respond to AI-related risks across their own systems and vendor relationships.