shipping production AI · since 2026 NAICS 541330 / 541511 / 541512 / 541519  ·  CMMC-aware
Refinery Report / AI Governance / post · rvices
AI GovernanceNIST AI RMFFinancial ServicesModel Risk Management

NIST AI RMF for Financial Services: A Practical Implementation Guide for Banks and Fintechs

How to implement NIST AI RMF in financial services: a practitioner guide that maps the four functions to SR 11-7 and audit-readiness.

D
By the DSE practice team
Operator-led practice · how we research & review
June 20, 2026
18 min · 3,859 words

By the DSE practice team · published June 20, 2026 · reviewed June 20, 2026

A model risk officer at a regional bank described the problem in one sentence. The board read about the NIST AI Risk Management Framework, the examiners asked how the bank’s AI use fit existing model risk guidance, and procurement started asking vendors for ISO 42001 certificates, and nobody could explain how the three connect. That confusion is the actual blocker for most banks and fintechs trying to govern AI, not a shortage of frameworks. There are too many frameworks and no map between them.

This guide draws that map for financial services. It explains what the NIST AI RMF four functions mean operationally for a bank, how they sit on top of the SR 11-7 model risk program you already run, where ISO/IEC 42001:2023 fits and where it does not, and what audit-readiness actually looks like in practice. We will keep the scope honest throughout. NIST AI RMF is voluntary and has no certification, so aligning to it supports readiness and shared language, not a compliance claim.

What NIST AI RMF means operationally for a bank

NIST AI RMF 1.0, published as NIST AI 100-1, is a voluntary framework for managing AI risk across the lifecycle. Its core is four functions, named exactly GOVERN, MAP, MEASURE, and MANAGE. For a bank, the value is not the framework’s abstractions but what each function forces you to produce: an owner, an artifact, and a decision. Read the four functions as four columns in your operating model, not four chapters in a document.

GOVERN is the connective tissue, and in a bank it lands on your existing risk and committee structure. It owns AI policy, accountability, roles, risk appetite, and oversight. The owner is typically the model risk management function reporting into the Chief Risk Officer, with board or risk-committee visibility. The artifacts GOVERN produces are the AI use policy, the governance committee charter and minutes, and the risk-tiering standard that says which AI systems get which level of scrutiny.

MAP establishes context and identifies risk in that context. For a bank, MAP is the AI inventory and the framing of each system: what it does, what data it touches, where it sits in a credit, fraud, or servicing decision, and who it affects. The owner is usually a shared responsibility between the business line that uses the model and model risk, because context requires both. The artifact MAP produces is a current, tiered AI inventory with data-provenance and use-context fields, which is the prerequisite for everything downstream.

MEASURE is the analytical function: testing, benchmarking, and monitoring. In a bank this is recognizable, because it is most of what validation and ongoing monitoring already do. MEASURE owns performance testing, bias and fair-lending testing, drift detection, and explainability assessment. The owner is independent model validation. The artifacts are validation reports, monitoring dashboards, and test results with defined thresholds.

MANAGE is the action function: prioritizing, responding, and recovering. It owns the response when MEASURE surfaces a problem, the decision to restrict or retire a model, and the documented treatment of accepted risks. The owner is again model risk in concert with the business. The artifact is the issue log, the remediation plan, and the disposition record for every flagged risk.

NIST AI RMF model risk management and the SR 11-7 overlap

Here is the most useful thing a bank can know: if you already run a credible SR 11-7 program, you are further along than the framework conversation suggests. SR 11-7, the joint Federal Reserve and OCC supervisory guidance on model risk management (Fed SR 11-7, OCC Bulletin 2011-12), already mandates the disciplines that MEASURE and MANAGE describe. Model development and implementation review, independent validation, ongoing monitoring, and governance are the SR 11-7 spine, and they map cleanly onto the two operational NIST functions.

The overlap is direct. SR 11-7 validation, with its emphasis on conceptual soundness, outcomes analysis, and ongoing monitoring, is MEASURE applied to models. SR 11-7 governance, with its insistence on roles, policies, and an inventory of models, is GOVERN and the inventory half of MAP. A bank doing this well does not need to rebuild anything to claim most of MEASURE and MANAGE. It needs to tag the work it already does to the NIST functions and find the gaps.

The gaps are where AI-specific risk lives, and they cluster in GOVERN and MAP. Traditional model risk management was built for statistical models with stable inputs and inspectable logic. It was not built for data provenance at the scale of foundation-model training sets, for drift in models that retrain or call external services, for explainability of systems that resist inspection, or for third-party foundation models you did not train and cannot fully see inside. These are the risks GOVERN and MAP add structure for, and they are exactly the places a 2013-era validation playbook goes quiet.

Third-party AI is the sharpest of these gaps, which is where OCC Bulletin 2013-29 comes in. OCC Bulletin 2013-29 is the supervisory guidance on third-party relationship risk management, and it governs the due diligence, contracting, and ongoing monitoring of vendors. Note that the OCC, Federal Reserve, and FDIC issued updated Interagency Third-Party Risk Management Guidance in June 2023 that is now frequently cited alongside or in place of 2013-29, though 2013-29 remains the named touchstone many programs still reference. For AI, third-party risk means foundation-model providers, AI-enabled SaaS, and the model supply chain, and it sits at the intersection of MAP (knowing the dependency exists) and GOVERN (having a policy for vetting it). For the broader treatment of these patterns in a bank context, see our work on banking AI governance.

How to implement NIST AI RMF in a finserv program

The mistake is starting with policy. A bank that opens with a GOVERN document writes rules for systems it has not inventoried and risks it has not measured, which produces a binder nobody can apply. The sequence that works inverts that instinct: establish context first, reuse what exists, then layer policy and testing onto a real picture.

Start with MAP and an inventory. You cannot govern, validate, or test an AI footprint you cannot see, and most banks underestimate theirs because AI now arrives bundled inside vendor products and wired into services with little ceremony. Build a current, tiered inventory that records each system, its owner, the data it handles, its place in a decision, and whether it was reviewed. This is the artifact every later step depends on.

Next, map the inventory onto your existing SR 11-7 governance. For each AI system, ask whether it is already a model under your model risk policy, and if so, what validation and monitoring it already receives. Most banks find that a meaningful share of their AI is already inside the model risk perimeter and already carries MEASURE and MANAGE evidence. The work here is reconciliation, not construction: connect what you have to the NIST functions and mark the systems that fall outside the existing perimeter.

Then layer GOVERN policy onto the reconciled picture. Now that you can see which systems exist and which are governed, write the AI use policy, the risk-tiering standard, and the committee charter against real systems rather than hypotheticals. Define risk appetite in terms your inventory can be measured against, and assign every high-tier system a named owner and an oversight cadence.

Build MEASURE for the AI-specific risks SR 11-7 validation does not fully cover. Extend your testing to drift detection for models that change over time, bias and fair-lending testing under ECOA and Regulation B, and explainability assessment for systems whose logic resists inspection. These are additive tests layered onto your validation discipline, not a parallel program. Finally, wire MANAGE: define the response path when a test fails, the authority to restrict or retire a system, and the documented disposition for every accepted risk, so findings drive action rather than accumulating in a report.

NIST AI RMF vs ISO 42001 and the EU AI Act

This is where most banks get the framing wrong, so be precise about it. US bank examiners supervise AI through the supervisory guidance you already answer to: SR 11-7 for model risk, third-party risk guidance for vendors, fair lending under ECOA and Regulation B, and prohibitions on unfair, deceptive, or abusive acts and practices. They do not supervise your AI through ISO/IEC 42001. ISO 42001 is real and valuable, but it is procurement and board assurance, not a substitute for your prudential exam posture.

ISO/IEC 42001:2023 is the AI management system standard, the AIMS standard. It follows the Annex SL management-system structure shared by ISO 27001, it is certifiable, and certification is granted by accredited third-party bodies after audit. That certifiability is exactly its use case. An ISO 42001 certificate is a credential you can show a customer, a partner, or a board, and a credential you can demand from a vendor in procurement. It signals a third-party-audited management system, which is genuine assurance value. What it does not do is satisfy a US prudential examiner, because that examiner is reading you against SR 11-7 and fair-lending law, not against an ISO standard.

The clean division of labor is this: NIST AI RMF plus SR 11-7 is your examiner-facing posture, and ISO 42001 is your procurement and board-assurance posture. They are complementary, not competing, and a large bank serving global customers may well run both. The EU AI Act is a third axis, binding only where you place AI systems on the EU market.

Dimension NIST AI RMF 1.0 (NIST AI 100-1) ISO/IEC 42001:2023 EU AI Act
Type and scope Voluntary US risk-management framework organized into four functions across the AI lifecycle Certifiable AI management system (AIMS) standard, Annex SL structure like ISO 27001 Binding EU regulation, risk-tiered into prohibited, high-risk, limited, and minimal
US regulatory alignment (examiner-facing?) Strongest US examiner alignment, via SR 11-7 model risk framing and existing supervisory guidance Indirect: procurement and board assurance, not a US prudential exam tool Not a US examiner tool, but extraterritorial for firms serving EU customers
Certification availability None. Voluntary framework, no certification program Certification available via accredited third-party bodies Conformity assessment and CE marking for high-risk systems, not ISO-style certification
Who it is for / primary use US firms organizing AI risk and building an examiner-facing posture Firms wanting a third-party-auditable AIMS, procurement signaling, and global assurance Firms placing AI systems on the EU market

A note on the EU AI Act timeline, stated honestly: its obligations phase in over 2025 to 2027, with high-risk obligations arriving on the later end of that window. We will not invent exact dates or article numbers here, because the precise applicability to a given system is the output of a real assessment, not a blog table.

What audit-readiness looks like in practice

Audit-readiness for a finserv AI program is not a posture you assert; it is a set of artifacts a reviewer can pick up and follow. The test is simple: when an examiner, internal auditor, or board committee asks how you govern a specific AI system, can you produce the evidence without scrambling? A ready program answers from files, not from memory. DSE prepares organizations to meet that bar; we do not certify and we do not guarantee any exam or audit outcome.

The artifacts that constitute readiness are concrete and mostly familiar to a model risk team. A current AI inventory tiered by risk. A documented model risk tiering standard that explains why each system sits where it does. Validation reports for in-scope models, with conceptual soundness, outcomes analysis, and limitations. Monitoring dashboards that show drift, performance, and threshold breaches over time. Governance committee minutes that record the decisions, the dissent, and the accepted risks.

Three artifacts deserve specific attention because they are where AI programs tend to be thin. Vendor due-diligence files for every third-party AI dependency, mapped to your third-party risk guidance, covering the foundation-model provider and the data terms. Fair-lending testing records for any AI touching credit decisions, documenting the disparate-impact analysis under ECOA and Regulation B. And an explainability assessment for high-tier systems, recording how the system’s behavior is understood and what compensating controls exist where it cannot be fully inspected.

The discipline that ties these together is the audit trail of decisions. A ready program does not just hold artifacts; it shows the chain from a risk being identified in MAP, measured in MEASURE, governed in GOVERN, and acted on in MANAGE. That traceability is what turns a pile of documents into a defensible posture, and it is what separates a program that prepares for examination from one that merely keeps files.

Document once, tag twice: layering onto SOC 2 and ISO 27001

Most banks and fintechs reading this already run a SOC 2 or ISO 27001 program, and the instinct to treat AI governance as a separate documentation effort is expensive and wrong. The principle that saves the most work is document once, tag twice: a control you already operate and evidence you already collect for SOC 2 or ISO 27001 can be tagged to a NIST AI RMF function and an ISO 42001 management-system clause without re-documenting it. You are not writing new controls; you are pointing existing ones at new frameworks.

Three examples make this concrete. Your access-control evidence, the user-access reviews and least-privilege records you already produce for SOC 2, tags directly to the MANAGE function for any AI system, because access scope is the blast radius of an AI tool. Your change-management evidence, the change-approval and deployment records you maintain for ISO 27001, tags to MEASURE and MANAGE for AI, because model deployment and retraining are changes that need the same control. Your vendor-management evidence, the due-diligence files you already keep, tags to MAP and GOVERN for AI and satisfies the third-party AI dependency requirement at the same time.

The payoff is that your AI governance program inherits a large share of its evidence from work already underway. The new work concentrates in the genuinely AI-specific places: drift monitoring, fair-lending testing, explainability, and foundation-model due diligence. Everything else is a tagging exercise against controls you operate today, which is the difference between a six-month program and an eighteen-month one.

What this guide is / What it is not

What it is: A practitioner implementation guide for layering NIST AI RMF onto an existing financial-services model risk program. It explains the four functions operationally, maps them to SR 11-7 and third-party risk guidance, places ISO 42001 and the EU AI Act accurately, and describes the artifacts that constitute audit-readiness. It is meant to give a Head of Model Risk, CDO, or CRO a defensible sequence and a shared language across risk, technology, and the board.

What it is not: It is not legal or regulatory advice, and it is not a certification, a conformity assessment, or evidence of compliance with any standard. NIST AI RMF is voluntary and has no certification program, so no document or mapping can certify alignment to it. ISO 42001 certification is real, but it is procurement and board assurance, not a substitute for a US prudential examination posture. DSE prepares organizations for audit and examination; we do not certify, and we do not guarantee any exam or audit outcome. Any vendor promising guaranteed regulatory approval is selling certainty that does not exist.

FAQ

What are the four functions of the NIST AI RMF? The NIST AI RMF 1.0, published as NIST AI 100-1, organizes AI risk management into four functions: GOVERN, MAP, MEASURE, and MANAGE. GOVERN sets policy, accountability, and oversight. MAP establishes context and identifies risk, including the AI inventory. MEASURE handles testing, benchmarking, and monitoring. MANAGE prioritizes, responds, and recovers. For a bank, each function maps to an owner, an artifact, and a decision rather than a chapter in a document.

How does NIST AI RMF relate to SR 11-7? SR 11-7, the joint Federal Reserve and OCC supervisory guidance on model risk management, already mandates the disciplines that the MEASURE and MANAGE functions describe: validation, outcomes analysis, ongoing monitoring, and governance. A bank running a credible SR 11-7 program typically has most of MEASURE and MANAGE covered. The GOVERN and MAP functions add structure for AI-specific risks that traditional model risk management was not built for, such as data provenance, drift, explainability, and third-party foundation models.

Does a US bank need ISO 42001 certification? No. US bank examiners supervise AI through SR 11-7, third-party risk guidance, fair lending under ECOA and Regulation B, and prohibitions on unfair, deceptive, or abusive practices, not through ISO/IEC 42001. ISO 42001 is a certifiable AI management system standard that is valuable for procurement signaling and board or third-party assurance, but it is not a substitute for a US prudential examination posture. The examiner-facing posture is built on NIST AI RMF plus SR 11-7.

Does implementing NIST AI RMF make a bank compliant or certified? No. NIST AI RMF is a voluntary framework with no certification program, so no document or implementation can certify alignment to it. Mapping an AI program to the four functions supports readiness, shared language, and a defensible examiner-facing posture. It is not a compliance claim or a conformity assessment. DSE prepares organizations for audit and examination but does not certify and does not guarantee any exam or audit outcome.

How do you implement NIST AI RMF in a financial services program? Start with MAP and a tiered AI inventory, because you cannot govern what you cannot see. Map that inventory onto your existing SR 11-7 governance to find what is already validated and monitored. Layer GOVERN policy onto the reconciled picture. Build MEASURE for AI-specific risks that SR 11-7 validation does not fully cover, including drift, fair-lending testing, and explainability. Then wire MANAGE so findings drive a documented response, restriction, or retirement rather than sitting in a report.

The Bottom Line

For a bank or fintech, the AI governance problem is not a shortage of frameworks but the absence of a map between them. NIST AI RMF plus SR 11-7 is your examiner-facing posture, ISO 42001 is your procurement and board-assurance posture, and the EU AI Act binds only where you serve the EU market. If you already run a credible SR 11-7 program, you hold most of MEASURE and MANAGE today, and the real work is concentrated in GOVERN, MAP, and the AI-specific testing that older model risk playbooks never required.

The sequence matters as much as the frameworks. Inventory first, reconcile against existing governance, layer policy onto real systems, extend testing to drift and fair lending and explainability, and tag the controls you already maintain for SOC 2 and ISO 27001 rather than re-documenting them. Done in that order, AI governance becomes a six-month layering exercise rather than a multi-year rebuild, and the output is a defensible, audit-ready posture expressed in language your examiners, your board, and your engineers all understand. That posture, and the work to reach it, is what an AI governance readiness engagement is built to deliver.


If you are layering NIST AI RMF onto an existing model risk program and want a head start on the artifacts, download the AI Governance Checklist for the inventory fields, tiering criteria, and readiness evidence a finserv program needs. When you are ready for a senior team to map your AI program to the four functions against your actual SR 11-7 governance and build the readiness posture, the AI Governance Readiness engagement does exactly that.

Key facts

Read next · AI Security & Governance

P
Founder · Principal Engineer
Data & AI engineer · 10+ yrs hands-on

Writes most of the long-form here. Lives in the codebase. Active on GitHub and LinkedIn.

§ Next step

Not sure which of these is you?

Tell us what's broken in a paragraph and a principal reads it directly — or walk the ladder from a low-commitment first engagement up to retained work.

One long-form a week. No marketing.

Subscribe to the Refinery Report. Practitioner deep-dives on AI engineering, security, and the realities of running production systems. Unsubscribe in one click.

~12 issues / quarter