SHARE

April 02, 2026

Building a transaction foundation model to power intelligent finance

Raghu Chetlapalli

Raghu Chetlapalli
Product Manager

Wen Yao

Wen Yao
Engineering Manager

As financial services move beyond static products toward dynamic, context-aware systems, the infrastructure needs a deeper understanding of financial data itself. With deeper context, budgeting tools can anticipate cash flow gaps before they occur, risk models recalibrate as spending patterns shift, and fraud detection adapts in real-time to emerging threats. 

We call this shift intelligent finance: systems that don't merely record transactions, but interpret them well enough to act on them.

As our customers build more adaptive financial products, they need richer context and stronger signals that generalize across use cases. Layering general-purpose AI on top of financial data isn't enough. What's needed is a shared, scalable representation of financial activity, one that works across institutions, products, and use cases. That's why Plaid built a transaction foundation model for finance.

Establishing a shared representation of financial activity

The foundation for intelligent finance is financial data, and transactions are one of the key data points for understanding financial behavior. Each one is a compact record of economic activity: who was paid, when, where, and how much. But raw transaction data is messy and inconsistent. The same merchant can appear in dozens of string variations, and the same string can mean different things depending on context and shifting metadata.

To make this data usable, we’ve spent years building and refining our enrichment pipeline that standardizes and adds context to transactions across institutions. This wasn’t a one-time solution, but required multiple iterations to resolve ambiguity, incorporate context, and move from surface-level cleaning to deeper understanding of financial activity. This work laid the groundwork for a more generalizable approach with our transaction foundation model.

Our transaction foundation model is trained on large-scale, anonymized transaction data across the Plaid network using self-supervised learning. This approach changes how we think about transaction intelligence in three ways: 

  1. Introducing context: Transactions are interpreted in relation to relevant signals, not as isolated text entries. 

  2. Strengthening generalization: The model handles unfamiliar formats by drawing on learned semantic patterns rather than brittle rule sets. 

  3. Creating a shared backbone: This shared representation supports multiple downstream tasks such as entity recognition, merchant normalization, categorization, semantic search, and risk signaling with minimal additional adaptation.

The result is a shift from fragmented, manually engineered systems to a single scalable model where improvements compound across every product it powers.

Plaid’s unique position in enabling intelligent finance

Building a robust foundation model for transactions requires both breadth and depth in data, spanning diverse institutions, merchant formats, geographies, account types, and product types to capture how financial activity actually behaves across the ecosystem. 

Plaid operates one of the largest user-permissioned financial data networks, covering thousands of financial institutions and a broad ecosystem of developers and financial partners. With more than a decade of working with transaction data, we’ve developed deep expertise in how financial activity is represented across contexts, and how the same economic event can look entirely different depending on the institution or format reporting it.

This cross-institutional perspective enables the model to learn structural patterns rather than institution-specific quirks. It allows us to build neutral infrastructure that improves multiple capabilities and APIs at once. For example, an improvement to merchant identity resolution simultaneously sharpens categorization accuracy and downstream risk signals.

While closed-loop platforms optimize for a single surface, Plaid’s purpose-built infrastructure enables every model improvement to translate into better accuracy, stronger signals, and less work for our customers. This is what makes truly context-aware, adaptive financial experiences possible.

How our transaction foundation model works at a high level

We trained a domain-specific encoder to learn semantic representations of transactions beyond surface-level text. To do this, we frame the problem as contrastive learning. Positive pairs reflect transactions with the same underlying financial meaning, while hard negatives capture cases that look similar but represent different economic events. The model learns to group truly equivalent transactions together, organizing the embedding space around financial intent rather than lexical similarity.

The result is a representation where transactions with the same financial meaning are placed together. This shared foundation can then power downstream tasks such as classification, entity resolution, pattern detection, and semantic retrieval across highly variable transaction descriptions.

What this unlocks for financial applications

Our approach is to treat the shared transaction representation as core infrastructure, then layer focused capabilities on top through lightweight adaptation. In practice, this leads to better end-user experiences with more accurate enrichments, personalized insights, and tools that better reflect how people actually manage their money.

This adaptation layer allows the same underlying representation to power multiple capabilities without rebuilding models for each use case. Because each task draws from the same representation, improvements at the foundation level compound across downstream systems. This shared backbone also enables the next generation of models we are developing, supporting more advanced behavioral modeling and financial reasoning without duplicating core learning.

The impact of this model is already measurable across Plaid products. For example, we’re already seeing strong accuracy gains across downstream tasks. 

  • Income classification improved by 48%, making it significantly better at identifying a person’s income.

  • Loan payment detection improved by 14%, increasing reliability in identifying on-time payments, a key signal of repayment ability. 

  • Bank fee classification improved by 22%. This matters because fees often act as early indicators of risk. Overdraft and NSF fees can signal financial strain, late fees reflect missed payments, and unusual charges like wire or foreign transaction fees may point to potential fraud.

The model captures deeper economic signals that help it disambiguate merchants operating across multiple verticals and handle edge cases that would trip up simpler models. Categorization reflects intent, not just matching keywords.

From feature engineering to shared infrastructure

Historically, launching a new financial intelligence feature meant designing bespoke pipelines, labeling new datasets, training a dedicated model, and standing up separate infrastructure—all from scratch, every time. Each capability evolved in isolation.

A foundation model changes that equation. The embedding layer becomes shared infrastructure. New use cases are built through lightweight adaptation rather than full reimplementation, and improvements to the core representation automatically propagate across every product it supports.

Deploying this in financial services demands operational rigor. Inference must meet low-latency requirements for real-time APIs. The system must remain highly reliable, cost-efficient at scale, handle long transaction histories, and generalize across regions with different payment norms including sparse or newly linked accounts where data is limited.

What’s next

A transaction foundation model captures the semantics of individual economic events, but financial behavior is inherently sequential. Spending, income, and transfers form patterns over time that encode intent, constraints, and risk in ways a single transaction cannot capture.

Our next step is to move from isolated embeddings to sequence foundation models that learn the structure of financial activity across time. By modeling temporal dependencies and recurring behaviors, we aim to represent not just what happened, but how financial lives evolve.

If you're building AI-powered financial experiences, we’d love to connect and explore how Plaid can support your team.