Skip to main content

Concept

A dealer performance scorecard for Request for Quote (RFQ) participation is a foundational component of a modern trading operation’s analytical architecture. Its purpose is to systematically quantify and evaluate the quality of liquidity and service provided by each counterparty within a bilateral price discovery protocol. You have likely experienced the frustration of sending a quote solicitation into the ether, only to receive responses that are wide, slow, or, in some cases, nonexistent.

The scorecard moves this process from the realm of anecdotal experience and gut feeling into a rigorous, data-driven discipline. It is the mechanism by which an institution transforms its flow into a strategic asset, leveraging its own trading activity to refine its network of liquidity providers and enhance execution quality over time.

The core function of this system is to create a feedback loop. Every RFQ sent and every quote received is a data point. The scorecard is the system that captures, normalizes, and analyzes these points to generate actionable intelligence. It provides a quantitative basis for answering critical operational questions.

Which dealers are most responsive for a given asset class during specific market conditions? Who provides the most competitive pricing for odd-lot sizes versus block sizes? Which counterparties show a pattern of consistently improving on the market’s touch price? Without a structured evaluation framework, this information remains fragmented across individual trader’s memories or buried within unstructured chat logs. The scorecard centralizes this knowledge, creating an institutional memory that compounds in value with every trade.

A dealer scorecard transforms subjective counterparty assessment into an objective, data-centric operational discipline.

This system is built upon the principle that not all liquidity is equal. A quote is more than just a price; it is a collection of attributes that includes the speed of response, the size of the quote, its competitiveness relative to the prevailing market, and the reliability of the dealer in honoring that quote. The scorecard deconstructs each dealer’s participation into these constituent elements, weighs them according to the institution’s strategic priorities, and produces a composite view of performance.

This allows for a sophisticated and nuanced approach to managing dealer relationships, moving beyond the simple metric of which dealer won the most trades. It enables a trading desk to cultivate a panel of liquidity providers that is optimized for its specific flow profile and execution objectives, ensuring that for any given trade, the RFQ is being directed to the counterparties most likely to provide high-fidelity execution.


Strategy

The strategic implementation of a dealer performance scorecard is centered on aligning its metrics and evaluation criteria with the overarching goals of the trading desk. The system’s design must reflect a conscious decision about what constitutes “good” execution for your specific mandate, whether that is minimizing market impact for large orders, achieving maximum price improvement on liquid instruments, or ensuring certainty of execution in volatile products. A properly calibrated scorecard becomes a powerful tool for shaping dealer behavior and optimizing the entire liquidity sourcing process.

A polished, dark, reflective surface, embodying market microstructure and latent liquidity, supports clear crystalline spheres. These symbolize price discovery and high-fidelity execution within an institutional-grade RFQ protocol for digital asset derivatives, reflecting implied volatility and capital efficiency

Defining the Core Evaluation Framework

The initial step involves establishing the primary categories of performance that will be measured. These categories form the pillars of the scorecard, with individual Key Performance Indicators (KPIs) nested within them. A robust framework typically revolves around three strategic dimensions ▴ Price Competitiveness, Execution Quality, and Relationship & Service.

Each dimension addresses a different aspect of the dealer’s contribution to the trading process. This multi-faceted approach ensures a holistic evaluation, preventing the over-optimization of a single metric (like price) at the expense of others (like information leakage or settlement efficiency).

This framework must be communicated transparently to the dealer panel. When counterparties understand the criteria by which they are being measured, they are better equipped to tailor their service to meet those expectations. This fosters a collaborative dynamic where the scorecard serves as a shared benchmark for continuous improvement. The weighting assigned to each category should be dynamic, allowing the institution to adjust its priorities based on changing market regimes or strategic shifts.

A scratched blue sphere, representing market microstructure and liquidity pool for digital asset derivatives, encases a smooth teal sphere, symbolizing a private quotation via RFQ protocol. An institutional-grade structure suggests a Prime RFQ facilitating high-fidelity execution and managing counterparty risk

What Are the Primary Dimensions of Dealer Performance?

The three core dimensions provide a comprehensive view of a dealer’s value. Price Competitiveness focuses on the quantitative aspect of the quotes received. Execution Quality assesses the reliability and efficiency of the dealer’s process from quote to settlement. Finally, Relationship & Service captures the qualitative aspects that are crucial for complex trades and long-term partnership.

  • Price Competitiveness ▴ This dimension measures the dealer’s ability to provide advantageous pricing. It includes metrics that compare the dealer’s quote not only to other dealers in the auction but also to prevailing market benchmarks at the time of the request. The goal is to identify providers who consistently offer tight spreads and price improvement.
  • Execution Quality ▴ This pillar evaluates the efficiency and reliability of the dealer’s trading infrastructure and operational processes. High marks in this category indicate a dealer who is technologically integrated, responsive, and dependable, minimizing the operational risk and post-trade overhead for the institution.
  • Relationship & Service ▴ This qualitative dimension assesses the dealer’s overall partnership value. It considers their willingness to quote difficult trades, provide market color and analysis, and the efficiency of their sales and support teams. This is particularly important for less liquid instruments or complex, multi-leg strategies where dialogue and expertise are paramount.
An intricate, high-precision mechanism symbolizes an Institutional Digital Asset Derivatives RFQ protocol. Its sleek off-white casing protects the core market microstructure, while the teal-edged component signifies high-fidelity execution and optimal price discovery

Comparative Strategic Weighting

Different trading desks will have different priorities. A high-turnover quantitative fund might place a heavy emphasis on response latency and price, while a macro hedge fund executing large, complex derivatives trades might prioritize a dealer’s willingness to commit capital and provide insightful market commentary. The scorecard’s weighting system must be flexible enough to accommodate these diverse strategic needs.

Strategic alignment requires weighting scorecard metrics to reflect the specific execution priorities of the trading desk.

The table below illustrates how two different types of institutions might assign different weights to the core performance categories. This customization is what transforms the scorecard from a generic reporting tool into a tailored strategic instrument.

Performance Category Quantitative Hedge Fund Weighting Asset Manager (Long-Only) Weighting
Price Competitiveness 60% 40%
Execution Quality 30% 40%
Relationship & Service 10% 20%


Execution

The execution phase of a dealer scorecard project translates the conceptual framework and strategic objectives into a functioning operational system. This involves a granular definition of metrics, the establishment of a data capture and analysis pipeline, and the integration of the scorecard’s output into the daily workflow of the trading desk. This is where the architectural vision is realized through meticulous engineering and process design.

Sleek metallic system component with intersecting translucent fins, symbolizing multi-leg spread execution for institutional grade digital asset derivatives. It enables high-fidelity execution and price discovery via RFQ protocols, optimizing market microstructure and gamma exposure for capital efficiency

The Operational Playbook

Implementing a dealer scorecard is a systematic process that requires careful planning and cross-departmental coordination, primarily between trading, technology, and operations. The following steps provide a high-level playbook for building and deploying a robust evaluation system.

  1. Define Granular KPIs ▴ For each of the core performance categories (Price, Execution, Relationship), define specific, measurable, and objective Key Performance Indicators. This is the most critical step, as the validity of the entire system rests on the quality of its underlying metrics. A list of potential KPIs is detailed in the quantitative modeling section below.
  2. Establish Data Architecture ▴ Identify all necessary data sources. This will primarily be the institution’s own Execution Management System (EMS) or Order Management System (OMS), which captures RFQ messages, quote responses, and execution records. It may also include third-party market data feeds for benchmark pricing (e.g. composite feeds, exchange top-of-book). A centralized data warehouse or a dedicated analytics database is required to store and process this information.
  3. Develop Scoring and Weighting Logic ▴ Code the logic for calculating each KPI. For each metric, establish a scoring methodology (e.g. a 1-5 scale, or a percentile rank). Then, implement the weighting system that allows the desk to assign importance to each KPI and category, producing a single, composite score for each dealer.
  4. Build Reporting and Visualization Dashboards ▴ The output of the scorecard must be presented in an intuitive and actionable format. Develop dashboards that allow traders to view dealer rankings, drill down into specific KPIs, and analyze performance over time and across different asset classes or market conditions. The interface should support both high-level summaries and deep-dive analysis.
  5. Integrate and Automate ▴ The scorecard should not be a standalone, manually-run report. Its data feeds and calculations should be automated to run on a regular basis (e.g. daily or weekly). The insights from the scorecard should be integrated back into the pre-trade workflow. For example, the EMS could use scorecard data to automatically suggest a list of top-ranked dealers for a specific RFQ.
  6. Schedule Formal Review Sessions ▴ Establish a regular cadence (e.g. quarterly) for formal performance reviews with each dealer. These meetings should be data-driven, using the scorecard as the basis for discussion. This creates a transparent and objective forum for providing feedback, addressing issues, and strengthening the partnership.
The image displays a sleek, intersecting mechanism atop a foundational blue sphere. It represents the intricate market microstructure of institutional digital asset derivatives trading, facilitating RFQ protocols for block trades

Quantitative Modeling and Data Analysis

The heart of the scorecard is its quantitative engine. Each metric must be precisely defined with a clear formula and data source. The table below details a set of core KPIs that form a comprehensive basis for evaluation. The formulas provided are illustrative and can be adapted based on the specific data available within an institution’s systems.

KPI Category Formula / Definition Data Sources Strategic Purpose
Hit Rate Execution (Number of RFQs Responded To / Number of RFQs Sent to Dealer) 100 EMS/OMS RFQ Logs Measures dealer’s willingness to quote and engage.
Win Rate Price (Number of RFQs Won by Dealer / Number of RFQs Responded To by Dealer) 100 EMS/OMS Execution Records Indicates how often a dealer’s quote is the most competitive.
Price Improvement Price (Benchmark Price – Executed Price) Trade Size. Can be aggregated or averaged. Execution Records, Market Data Feed Quantifies the value added versus a market benchmark (e.g. arrival mid-price).
Response Latency Execution Average time (in milliseconds or seconds) between RFQ Sent timestamp and Quote Received timestamp. EMS/OMS RFQ Logs Measures technological speed and attentiveness of the dealer.
Quoted Spread Price Average of (Dealer’s Offer – Dealer’s Bid) for two-sided quotes. EMS/OMS RFQ Logs Measures the competitiveness of a dealer’s pricing independent of winning the trade.
Information Leakage Signal Execution Correlation between a dealer’s quote submission and adverse market movement prior to execution. Requires advanced statistical analysis. RFQ Logs, High-Frequency Market Data Identifies potential signaling risk associated with quoting to a specific dealer.
Settlement Failure Rate Relationship (Number of Failed Settlements / Total Number of Trades) 100 Operations/Settlements System Measures post-trade operational reliability.
A precise digital asset derivatives trading mechanism, featuring transparent data conduits symbolizing RFQ protocol execution and multi-leg spread strategies. Intricate gears visualize market microstructure, ensuring high-fidelity execution and robust price discovery

Predictive Scenario Analysis

Consider the challenge faced by a portfolio manager at a large asset management firm who needs to execute a block trade of $50 million in a specific corporate bond. The bond is relatively liquid but a trade of this size could still cause significant market impact if not handled with care. The firm’s trading desk utilizes a sophisticated dealer scorecard system integrated directly into their EMS.

The portfolio manager’s primary objective is to minimize slippage relative to the arrival price while ensuring a high fill rate. The head trader consults the scorecard’s pre-trade analytics module to construct an optimal RFQ strategy.

The system first filters the firm’s dealer panel for counterparties who have demonstrated strong performance in investment-grade corporate bonds of similar duration and credit quality over the past six months. This initial filter narrows the list from 25 potential dealers down to 12. Next, the trader applies a strategic weighting to the scorecard model that prioritizes Price Improvement (40% weight), Information Leakage Signal (30% weight), and Hit Rate (20% weight), with the remaining 10% allocated to Response Latency. The system is being told that price is important, but preventing the market from moving against the trade is a very close second priority.

The scorecard dashboard populates with the re-weighted rankings. Dealer A, a large bulge-bracket bank, scores highest overall. Their Price Improvement metric is consistently in the top quartile, and their Response Latency is measured in sub-seconds due to heavy investment in API-driven quoting. Critically, their Information Leakage Signal is very low, indicating their quoting and hedging activity is well-internalized and does not tend to spook the market.

Dealer B, a specialized fixed-income house, ranks second. Their Price Improvement is slightly lower than Dealer A’s, but their Hit Rate for requests over $25 million is 95%, demonstrating a strong appetite for large-size risk. Their Information Leakage score is also excellent.

Dealer C, another large bank, ranks third. While their raw Win Rate is historically high, the scorecard reveals a potential issue. The trader drills down into Dealer C’s metrics and sees that their Information Leakage Signal is significantly higher than the others. The system’s notes, compiled from past trader observations, indicate that on several occasions, large RFQs sent to Dealer C have been followed by a noticeable fade in the market’s bid price before the trade was completed.

This suggests that Dealer C’s own trading or hedging activity may be more transparent to the broader market, creating signaling risk. Without the scorecard, the trader might have included Dealer C based on their strong historical relationship and high win rate. With the data-driven insight, the trader decides to exclude Dealer C from this specific, sensitive inquiry.

Actionable intelligence from a scorecard allows for the dynamic construction of an RFQ panel optimized for a specific trade’s objectives.

The trader constructs a “wave” of RFQs. The first wave is sent to the top four dealers identified by the scorecard ▴ Dealer A, Dealer B, and two others with similar profiles. The RFQ is sent for a partial amount, $25 million, to test the waters. The responses come back within the expected timeframes.

Dealer A provides the best quote, and the trader executes with them. The post-trade analytics module automatically captures the execution details and updates the scorecard’s underlying data. The system calculates the price improvement for this fill at +$0.02 per bond against the arrival mid-price, a tangible saving for the fund. For the second wave, to complete the order, the trader includes Dealer B and two others, again leveraging the scorecard’s data to select the counterparties most likely to provide competitive pricing without creating adverse selection.

This systematic, data-informed process, repeated across thousands of trades, results in a measurable improvement in overall execution quality for the firm, directly contributing to fund performance. The scorecard, in this context, is the central nervous system of the execution process.

A sleek, dark metallic surface features a cylindrical module with a luminous blue top, embodying a Prime RFQ control for RFQ protocol initiation. This institutional-grade interface enables high-fidelity execution of digital asset derivatives block trades, ensuring private quotation and atomic settlement

System Integration and Technological Architecture

A central metallic mechanism, representing a core RFQ Engine, is encircled by four teal translucent panels. These symbolize Structured Liquidity Access across Liquidity Pools, enabling High-Fidelity Execution for Institutional Digital Asset Derivatives

How Should the Scorecard Integrate with Existing Trading Systems?

The dealer scorecard cannot exist in a vacuum. Its value is maximized when it is deeply integrated into the firm’s existing trading technology stack. The core principle of this integration is the seamless flow of data from the point of action (the EMS/OMS) to the point of analysis (the scorecard engine) and back to the point of action (pre-trade decision support).

The primary integration point is with the Execution Management System. The EMS is the source of the raw event data that fuels the scorecard. This includes every RFQ message sent, every quote received, every execution, and every cancellation.

A robust API connection is required to stream this data in real-time or near-real-time into the scorecard’s database. This data must be captured in a structured format, including security identifiers, timestamps (with millisecond precision), dealer names, quote sizes, prices, and trade details.

Secondly, the scorecard system needs access to a reliable market data source to calculate metrics like Price Improvement. This involves connecting to a consolidated data feed that can provide a benchmark price (e.g. top-of-book, volume-weighted average price, or a composite price from multiple venues) for any given instrument at a specific point in time. The architecture must be able to query this market data source with a historical timestamp to retrieve the benchmark price that was valid at the moment an RFQ was sent or an execution occurred.

Finally, the intelligence generated by the scorecard must be fed back into the EMS to assist traders in their decision-making process. This “closing of the loop” is the most advanced stage of integration. For example, when a trader initiates an RFQ for a particular instrument, the EMS can make an API call to the scorecard system.

The scorecard, based on its historical data and the characteristics of the current order (size, asset class, etc.), can return a ranked list of suggested dealers. This decision support tool empowers the trader with data-driven recommendations directly within their workflow, augmenting their own market knowledge and improving the efficiency and quality of the dealer selection process.

A precise geometric prism reflects on a dark, structured surface, symbolizing institutional digital asset derivatives market microstructure. This visualizes block trade execution and price discovery for multi-leg spreads via RFQ protocols, ensuring high-fidelity execution and capital efficiency within Prime RFQ

References

  • Harris, Larry. “Trading and Exchanges Market Microstructure for Practitioners.” Oxford University Press, 2003.
  • O’Hara, Maureen. “Market Microstructure Theory.” Blackwell Publishers, 1995.
  • Madhavan, Ananth. “Market Microstructure ▴ A Survey.” Journal of Financial Markets, vol. 3, no. 3, 2000, pp. 205-258.
  • Bessembinder, Hendrik, and Herbert M. Spilker. “Transaction Costs and Trading Behavior in the Junk Bond Market.” The Journal of Finance, vol. 54, no. 4, 1999, pp. 1471-1496.
  • Chordia, Tarun, Richard Roll, and Avanidhar Subrahmanyam. “Order Imbalance, Liquidity, and Market Returns.” Journal of Financial Economics, vol. 65, no. 1, 2002, pp. 111-140.
  • “MiFID II Best Execution Reporting ▴ A Practitioner’s Guide.” Financial Conduct Authority, 2017.
  • “Best Execution in Fixed Income.” Securities Industry and Financial Markets Association (SIFMA), White Paper, 2021.
A sophisticated mechanism depicting the high-fidelity execution of institutional digital asset derivatives. It visualizes RFQ protocol efficiency, real-time liquidity aggregation, and atomic settlement within a prime brokerage framework, optimizing market microstructure for multi-leg spreads

Reflection

The construction of a dealer performance scorecard is an exercise in institutional self-awareness. It forces a trading desk to move beyond legacy relationships and anecdotal evidence, and to define, with quantitative precision, what it values in its liquidity partners. The process of building this system is as valuable as the output itself, as it necessitates a deep examination of your firm’s own execution objectives and operational capabilities. The completed scorecard is more than a reporting tool; it is a dynamic representation of your firm’s place within its market ecosystem.

It provides a mirror to your own flow and a lens through which to view your counterparties. The ultimate question this system poses is not just “Who are our best dealers?” but “How can we leverage this intelligence to systematically improve every single execution?” The answer to that question is the foundation of a durable competitive edge.

A sophisticated metallic and teal mechanism, symbolizing an institutional-grade Prime RFQ for digital asset derivatives. Its precise alignment suggests high-fidelity execution, optimal price discovery via aggregated RFQ protocols, and robust market microstructure for multi-leg spreads

Glossary

A precisely engineered central blue hub anchors segmented grey and blue components, symbolizing a robust Prime RFQ for institutional trading of digital asset derivatives. This structure represents a sophisticated RFQ protocol engine, optimizing liquidity pool aggregation and price discovery through advanced market microstructure for high-fidelity execution and private quotation

Dealer Performance Scorecard

Meaning ▴ A Dealer Performance Scorecard, in the context of institutional crypto trading and request-for-quote (RFQ) systems, is a structured analytical tool used to quantitatively evaluate the effectiveness and quality of liquidity provision by market makers or dealers.
Intersecting metallic components symbolize an institutional RFQ Protocol framework. This system enables High-Fidelity Execution and Atomic Settlement for Digital Asset Derivatives

Rfq

Meaning ▴ A Request for Quote (RFQ), in the domain of institutional crypto trading, is a structured communication protocol enabling a prospective buyer or seller to solicit firm, executable price proposals for a specific quantity of a digital asset or derivative from one or more liquidity providers.
A sleek metallic teal execution engine, representing a Crypto Derivatives OS, interfaces with a luminous pre-trade analytics display. This abstract view depicts institutional RFQ protocols enabling high-fidelity execution for multi-leg spreads, optimizing market microstructure and atomic settlement

Execution Quality

Meaning ▴ Execution quality, within the framework of crypto investing and institutional options trading, refers to the overall effectiveness and favorability of how a trade order is filled.
A translucent sphere with intricate metallic rings, an 'intelligence layer' core, is bisected by a sleek, reflective blade. This visual embodies an 'institutional grade' 'Prime RFQ' enabling 'high-fidelity execution' of 'digital asset derivatives' via 'private quotation' and 'RFQ protocols', optimizing 'capital efficiency' and 'market microstructure' for 'block trade' operations

Trading Desk

Meaning ▴ A Trading Desk, within the institutional crypto investing and broader financial services sector, functions as a specialized operational unit dedicated to executing buy and sell orders for digital assets, derivatives, and other crypto-native instruments.
Precision instruments, resembling calibration tools, intersect over a central geared mechanism. This metaphor illustrates the intricate market microstructure and price discovery for institutional digital asset derivatives

Dealer Performance

Meaning ▴ Dealer performance quantifies the efficacy, responsiveness, and competitiveness of liquidity provision and trade execution services offered by market makers or institutional dealers within financial markets, particularly in Request for Quote (RFQ) environments.
Intersecting teal and dark blue planes, with reflective metallic lines, depict structured pathways for institutional digital asset derivatives trading. This symbolizes high-fidelity execution, RFQ protocol orchestration, and multi-venue liquidity aggregation within a Prime RFQ, reflecting precise market microstructure and optimal price discovery

Liquidity Sourcing

Meaning ▴ Liquidity sourcing in crypto investing refers to the strategic process of identifying, accessing, and aggregating available trading depth and volume across various fragmented venues to execute large orders efficiently.
A vibrant blue digital asset, encircled by a sleek metallic ring representing an RFQ protocol, emerges from a reflective Prime RFQ surface. This visualizes sophisticated market microstructure and high-fidelity execution within an institutional liquidity pool, ensuring optimal price discovery and capital efficiency

Information Leakage

Meaning ▴ Information leakage, in the realm of crypto investing and institutional options trading, refers to the inadvertent or intentional disclosure of sensitive trading intent or order details to other market participants before or during trade execution.
A futuristic apparatus visualizes high-fidelity execution for digital asset derivatives. A transparent sphere represents a private quotation or block trade, balanced on a teal Principal's operational framework, signifying capital efficiency within an RFQ protocol

Price Improvement

Meaning ▴ Price Improvement, within the context of institutional crypto trading and Request for Quote (RFQ) systems, refers to the execution of an order at a price more favorable than the prevailing National Best Bid and Offer (NBBO) or the initially quoted price.
Abstract geometric forms, symbolizing bilateral quotation and multi-leg spread components, precisely interact with robust institutional-grade infrastructure. This represents a Crypto Derivatives OS facilitating high-fidelity execution via an RFQ workflow, optimizing capital efficiency and price discovery

Response Latency

Meaning ▴ Response Latency, within crypto trading systems, quantifies the time delay between the initiation of an action, such as submitting an order or a Request for Quote (RFQ), and the system's corresponding reaction, like an order confirmation or a definitive price quote.
Intersecting abstract geometric planes depict institutional grade RFQ protocols and market microstructure. Speckled surfaces reflect complex order book dynamics and implied volatility, while smooth planes represent high-fidelity execution channels and private quotation systems for digital asset derivatives within a Prime RFQ

Dealer Scorecard

Meaning ▴ A Dealer Scorecard is an analytical tool employed by institutional traders and RFQ platforms to systematically evaluate and rank the performance of market makers or liquidity providers.
A high-fidelity institutional digital asset derivatives execution platform. A central conical hub signifies precise price discovery and aggregated inquiry for RFQ protocols

Order Management System

Meaning ▴ An Order Management System (OMS) is a sophisticated software application or platform designed to facilitate and manage the entire lifecycle of a trade order, from its initial creation and routing to execution and post-trade allocation, specifically engineered for the complexities of crypto investing and derivatives trading.
A polished glass sphere reflecting diagonal beige, black, and cyan bands, rests on a metallic base against a dark background. This embodies RFQ-driven Price Discovery and High-Fidelity Execution for Digital Asset Derivatives, optimizing Market Microstructure and mitigating Counterparty Risk via Prime RFQ Private Quotation

Market Data

Meaning ▴ Market data in crypto investing refers to the real-time or historical information regarding prices, volumes, order book depth, and other relevant metrics across various digital asset trading venues.
A sleek, futuristic object with a glowing line and intricate metallic core, symbolizing a Prime RFQ for institutional digital asset derivatives. It represents a sophisticated RFQ protocol engine enabling high-fidelity execution, liquidity aggregation, atomic settlement, and capital efficiency for multi-leg spreads

Scorecard System

Meaning ▴ A Scorecard System is a structured performance management tool that evaluates entities or processes against a predefined set of criteria and key performance indicators (KPIs).
A precisely engineered multi-component structure, split to reveal its granular core, symbolizes the complex market microstructure of institutional digital asset derivatives. This visual metaphor represents the unbundling of multi-leg spreads, facilitating transparent price discovery and high-fidelity execution via RFQ protocols within a Principal's operational framework

Information Leakage Signal

A tick size reduction elevates the market's noise floor, compelling leakage detection systems to evolve from spotting anomalies to modeling systemic patterns.