Datasets
Classification
Classification is the core analytical engine of SpendCraft. It systematically organizes raw transaction data into a standardized, hierarchical structure, converting individual purchase lines into actionable spend categories.
π― The Goal of Classification
The primary objective is to assign every line item in a Dataset to a precise, multi-level category path defined by your Taxonomy.
| Stage | Data Example |
|---|---|
| Input | "Invoice for 50 licenses of Adobe Photoshop, Dept MKTG" |
| Process | Transaction analyzed by the AI Agent using Vendor & Description. |
| Output | Indirect Spend $\rightarrow$ IT $\rightarrow$ Software $\rightarrow$ Creative Software |
This standardization allows you to compare spend across different business units and geographies using a single, unified lens.
π€ The Classification Process
Classification is an automated workflow driven by the AI Agent:
- AI Agent Selection: You select a pre-trained Agent containing rules learned from previously verified data.
- Field Analysis: The Agent analyzes textual fields like Vendor Name, Transaction Description, and GL Codes.
- Pattern Mapping: The Agent maps the transaction to the most appropriate taxonomy category based on its training.
- Confidence Scoring: Each result receives a score. Low-confidence classifications (e.g., below 80%) are flagged for manual audit.
π Continuous Learning Loop
Classification accuracy improves over time through a built-in feedback mechanism:
- Auditing & Correction: Analysts review the results. If a "Laptop" was incorrectly classified as "Office Supplies," they manually correct it to "IT Hardware."
- Model Refinement: SpendCraft captures these corrections and feeds them back into the AI Agent's training data.
- Performance Impact: The Agent updates its internal logic, ensuring it handles similar future transactions correctly without human intervention.
π Strategic Importance
- Benchmarking: Compare category spend accurately against industry standards.
- Category Management: Provide managers with a 360-degree view of total addressable spend (e.g., all "Cloud Hosting" spend regardless of the vendor).
- Data Consistency: Ensure every department uses the same definitions and language for expenses.