Collatio

Collatio

Providing accurate, reliable data & its lineage

Features

BACKGROUND

Resolution of out-of-date data, duplicates, near duplicates & incomplete/missing data (continually) as new data on customers, entities, transactions expands dynamically

KEY FEATURES

  • Matches and finds similar records across different data sources
  • De-duplicates “similar” records using fuzzy matching
  • Harmonizes names & addresses and validates with external sources
  • Enriches data with 30+ attributes e.g. URL, parent company, key contacts
  • Identifies “best match” pair from different data sources
  • Detects & fixes errors via ensemble of proprietary ML & NLP algorithms

BACKGROUND

Data from multiple sources moves through several systems & is transformed, which leads to non-traceability of data’s origins, transformations & errors

KEY FEATURES

  • Identifies data lineage via functional and format transformations
  • Time-series graphs to show parameters & transformations at each “hop”
  • Pre-built library of (500+) transformation rules for various data types
  • Proprietary ML & NLP algorithms to “reverse engineer” business rules
  • Generates exceptions and alerts on incremental data

BACKGROUND

  • Financial & operating documents (e.g., balance sheets, income statements, rent rolls, fund & tax statements) are often available in PDF, JPEG or paper formats
  • Firms manually digitize them, which is expensive to scale & results in errors

KEY FEATURES

  • ML, NLP and graph-based solution to “reverse engineer” formulas
  • 40+ pre-trained proprietary algorithms to digitize, validate values & fix errors
  • Reinforcement learning on “its own” and via a “human loop”
  • Pre-built financial ontology & business rules
  • Output maps to required financial templates
  • Points out specific areas where the analyst should review

BACKGROUND

Agreements in PDF or non machine-readable format require manual processing to detect non-compliance, which is expensive & difficult to scale; lack of compliance in agreements expose organizations to various risks.

KEY FEATURES

  • ML, NLP & graph based culling of attributes from agreements
  • 45+ Pre-trained modules to digitize, validate attributes & fix errors
  • Reinforcement learning on “its own” and via a “human loop”
  • Prebuilt ontology for NDAs, SoWs, MSAs; fast creation of other ontologies
  • Search & compare documents; find trends & generate alerts

BACKGROUND

  • Duplicate payments, non-compliance & overpaid SoWs
  • Current reconciliation process is manual, difficult to scale & leads to errors
  • Companies lose average between 0.1% & 0.3% of all payments

KEY FEATURES

  • Culling of attributes using ML, NLP & graph based algorithms
  • 45+ pre-trained modules to digitize, validate attributes & fix errors
  • Determines duplicate & near-duplicate invoices
  • Reinforcement learning on “its own” and via a “human loop”
  • Prebuilt ontology for NDAs, SoWs, MSAs; fast creation of other ontologies

Key Differentiators & Business Benefits

anomaly-img

Anomaly Detection

Library of 25+ proprietary supervised & unsupervised algorithms, pre-trained & tested for our products

accuracy-img

Knowledge Graphs

30+ probabilistic spatial & temporal graph algorithms to determine links & connected entities

ontology-img

Domain Ontologies

Prebuilt financial ontologies & business rules that each product will improve over time

data-governance-img

Data Cleansing & Harmonization

60+ algorithms for cleansing, harmonizing & validating data from internal & external sources in 30+ formats, e.g., JPEG, video, PDFs

reinforcement-learning-img

External Data Enrichment

40+ scrapers & connectors for ingesting web & paid data subscriptions, e.g., news, social media, traffic, geo-location, sanction lists

Reinforcement-img

Reinforcement Learning

In-built reinforcement learning algorithms that improves our products’ performance over time “with & without a human loop”

interface-img

User Interface and APIs

Pre-built graphical user interface (GUI) & APIs for quick deployment & integration with clients’ workflows

deployment-img

Deployment & Scalability

Highly scalable with parallel & distributed computing with options to deploy on-premise or consume as SaaS

reverse-eng-img

Reverse Engineering

40+ proprietary ML & NLP algorithms to identify transformations & business rules and to fix manual & OCR errors

product-accuracy-img

Product Accuracy

91% – 98% accuracy that improves over time & with high quality ontology

time-cost-img

Time & Cost

60% – 85% improvement in time & cost over the current manual process

data-governance-img

Data Governance

Provides role-based access & sends alerts for exceptions to strategies, policies & compliance guidelines

For Demo & Additional Information

GET IN TOUCH