Deterministic Workflow for Accurate HS Tariff Classification

Date:

A Deterministic Agentic Workflow for HS Tariff Classification: Multi-Dimensional Rule Reasoning with Interpretable Decisions

The recent paper titled “A Deterministic Agentic Workflow for HS Tariff Classification: Multi-Dimensional Rule Reasoning with Interpretable Decisions” presents a novel approach to one of the most complex tasks in international trade: Harmonized System (HS) tariff classification. The study, available on arXiv under the identifier 2605.14857v1, addresses the challenges faced by existing systems in accurately mapping product descriptions to specific tariff codes using a structured, interpretable workflow.

HS tariff classification requires an expert-level understanding of intricate rules that govern the assignment of six- or eight-digit codes. These rules are outlined under the General Interpretive Rules (GIR) and supplemented by section notes, chapter notes, and Explanatory Notes. The primary challenge in this task is not merely the volume of knowledge required but the complexity of *multi-dimensional rule reasoning*. This involves balancing competing priority rules across various criteria such as material composition, product form, functional use, essential character, and the distinction between parts and wholes.

The Shortcomings of Current Approaches

One significant limitation of current automated systems is their reliance on end-to-end prompting of large language models (LLMs). While these models can process vast amounts of information, they often fall short by resolving only one aspect of classification while neglecting others, leading to inaccuracies in decision-making.

A New Workflow Model

The authors propose a *deterministic agentic workflow* as a solution. This framework contrasts with self-planning agents by establishing a fixed control flow. Key features of the proposed model include:

  • Narrowly Confined Language Model Calls: The workflow limits language model interactions to specific stages, minimizing the risk of misinterpretation.
  • Local Reflection and Verification: Each stage incorporates mechanisms for reflection and verification, enhancing the reliability of the outputs.
  • Structured Outputs: Decisions are broken down into structured outputs that provide verbatim citations from relevant chapter or section notes, thereby improving interpretability.

The combination of offline knowledge-engineering, particularly of the Chinese HS tariff, with an online six-stage pipeline makes this model particularly robust. The authors have evaluated their workflow using HSCodeComp at the six-digit level, achieving notable results.

Performance Metrics

The workflow’s performance metrics are impressive:

  • 75.0% Top-1 and 91.5% Top-3 Accuracy: Achieved at the four-digit level.
  • 64.2% Top-1 and 78.3% Top-3 Accuracy: Achieved at the six-digit level.

Furthermore, the Qwen3.6-plus backbone in non-thinking mode produced an 84.2% agreement at the four-digit level and 77.4% at the six-digit level, indicating strong alignment with frontier models.

Implications for the Future

A two-stage manual audit of 226 six-digit disagreements revealed that a significant portion of the HSCodeComp ground-truth labels may not align with established HS general rules. This finding raises critical questions about the accuracy of existing classifications and suggests the need for further community review and discussion.

In conclusion, this research underscores the importance of structured, interpretable approaches in high-stakes classification tasks. The findings and methodologies presented promise to enhance the accuracy and reliability of HS tariff classifications, paving the way for more effective international trade practices.

Related AI Insights

Lazarus Omolua
Lazarus Omoluahttps://richlyai.com/blog
My mission is to make sure that people in Africa are not left behind in the global AI revolution. RichlyAI exists to give everyone — students, founders, creators, and businesses — the tools to compete globally.

Subscribe

Popular

More like this
Related

How Business Ops Teams Boost Productivity with Codex

Discover how business operations teams use Codex to streamline documentation, enhance collaboration, and improve decision-making with AI-powered automation...

OpenAI Partners with Malta to Offer ChatGPT Plus Nationwide

OpenAI and Malta team up to provide free ChatGPT Plus access and AI training to all citizens, promoting digital literacy and responsible AI use.

Critical Linux Kernel Flaw Risks SSH Host Key Theft

A critical Linux kernel flaw risks stolen SSH host keys. Learn how to protect your systems and stay secure until patches are widely available.

Top External Hard Drives 2026: Expert Reviews & Buying Guide

Discover the best external hard drives of 2026 with expert reviews. Find top picks for speed, durability, and security to suit all storage needs.