Impact of AML Scoring Granularity on Elliptic++ Graph Analysis

Do Transaction-Level and Actor-Level AML Queues Agree? An Empirical Evaluation of Granularity Effects on the Elliptic++ Graph

In recent developments within the field of anti-money laundering (AML), a new paper has been released that offers a comprehensive examination of how different scoring granularities can influence investigation outcomes in blockchain networks. Titled “Do Transaction-Level and Actor-Level AML Queues Agree? An Empirical Evaluation of Granularity Effects on the Elliptic++ Graph,” this study aims to bridge the gap between transaction-level and actor-level analyses, providing critical insights for compliance actions.

Published on arXiv, the paper highlights the dual granularity levels at which graph-based AML systems can assess suspicious activities: transaction-level and actor address-level. While compliance measures are taken at the actor level, the scoring system can vary significantly based on the granularity applied, leading to potential discrepancies in investigation queue compositions.

Methodology Overview

The authors introduce an innovative evaluation methodology designed to measure the impact of scoring granularity on investigation queues under fixed review budgets. This methodology is formalized through a projection framework that maps transaction-level scores to actor-level action units using four distinct aggregation operators. Additionally, the paper presents novel budgeted investigation metrics, including:

Yield@Budget: A metric to assess the efficiency of the review process.
Burden Decomposition: An analysis of the workload distribution across different queues.
Case Fragmentation: A measure of how cases are split among different investigation queues.

Utilizing the public Elliptic++ Bitcoin dataset, which comprises 203,769 transactions and 822,942 address occurrences, the researchers trained independent random forest classifiers for both transaction-level and actor-level analyses. They employed a causal temporal protocol to ensure the integrity of their evaluations, comparing review queues through metrics such as Jaccard overlap, burden decomposition, and feature-matching ablations.

Key Findings

The empirical results of the study reveal significant variances in queue compositions based on the chosen scoring granularity. At a one-percent budget, the temporal evaluation yielded a mean Jaccard overlap of 0.374 (SD 0.171), whereas the static pooled evaluation produced a considerably lower overlap of 0.087 (95% CI [0.079, 0.094]). This discrepancy highlights the limitations of static models in capturing the dynamic nature of illicit activity.

Interestingly, when an enriched address model utilizing all 237 features was applied, the overlap diminished even further, yielding a Jaccard score of 0.051. The research indicates that only 4.3% of reviewed cases were identified as illicit per 100 reviews, a stark contrast to the 30.2% illicit detection rate observed in the transaction-projected queue.

Moreover, the study found that address-level detection efficacy was highly temporally concentrated, with two timesteps exceeding 91% illicit detection per 100 reviews, while the static burden exhibited a mere 3.4%. The authors also note that a fixed hybrid policy underperformed compared to the best single-level queue by 5.05 percentage points (CI [-10.2pp, -0.9pp]).

Conclusion

The findings presented in this paper underscore the importance of scoring granularity as a critical design variable within AML investigation systems. The research reveals that using the same data and budget can result in vastly different queues and different addresses being investigated, suggesting a need for regulators and practitioners to carefully consider the implications of their chosen scoring methodology.

As the landscape of blockchain technology and financial crime evolves, such insights will be invaluable for enhancing the effectiveness of AML strategies and ensuring compliance with regulatory requirements.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

Impact of AML Scoring Granularity on Elliptic++ Graph Analysis

Do Transaction-Level and Actor-Level AML Queues Agree? An Empirical Evaluation of Granularity Effects on the Elliptic++ Graph

Methodology Overview

Key Findings

Conclusion

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related