🚀 AML Risk Assessment

📌 Table of Contents

Introduction
Demo
What It Does
How We Built It
Challenges We Faced
How to Run
Tech Stack
Team

🎯 Introduction

The main objective of this project is to develop a sophisticated AI/ML-powered system that automates entity research, verification, and risk scoring. By leveraging Generative AI, multi-source transaction data analysis, and automated workflows, we aim to enhance accuracy, reduce manual effort, and provide a robust risk evaluation framework. This solution will empower analysts with intelligent insights for informed decision-making.

🎥 Demo

🔗 Live Demo
📹 Video Demo

🖼️ Screenshots

⚙️ What It Does

%%{
  init: {
    'theme': 'dark',
    'themeVariables': {
      'primaryColor': '#0078D7',
      'primaryTextColor': '#FFFFFF',
      'primaryBorderColor': '#5B9BD5',
      'lineColor': '#5B9BD5',
      'secondaryColor': '#2C3E50',
      'tertiaryColor': '#2980B9',
      'fontFamily': 'Arial, sans-serif'
    },
    'flowchart': {
      'curve': 'basis',
      'diagramPadding': 10
    }
  }
}%%

flowchart TB
    classDef mainNode fill:#0078D7,stroke:#5B9BD5,stroke-width:2px,color:white,font-weight:bold
    classDef enrichNode fill:#7030A0,stroke:#9b59b6,stroke-width:2px,color:white
    classDef sourceNode fill:#00B0F0,stroke:#00B0F0,stroke-width:2px,color:white
    classDef riskNode fill:#C00000,stroke:#C00000,stroke-width:2px,color:white
    classDef outputNode fill:#ED7D31,stroke:#ED7D31,stroke-width:2px,color:white
    
    %% Main workflow
    Transaction["📄 Transaction Data"]:::mainNode
    ExtractEntities["🔍 Extract Entities"]:::mainNode
    EnrichData["📊 Enrich Data"]:::enrichNode
    AssessRisk["⚖️ Risk Assessment"]:::riskNode
    StoreResults["💾 Store Results"]:::outputNode
    SendCallback["🔄 API Callback"]:::outputNode
    
    %% Flow connections
    Transaction --> ExtractEntities --> EnrichData --> AssessRisk --> StoreResults --> SendCallback
    
    %% Data sources
    subgraph DataSources["Intelligence Sources"]
        direction TB
        OpenCorp["🏛️ Corporate Registry"]:::sourceNode
        Sanctions["⚠️ Sanctions Lists"]:::sourceNode
        PEP["👑 PEP Database"]:::sourceNode
        Wikidata["🌐 Entity Networks"]:::sourceNode
        News["📰 Adverse Media"]:::sourceNode
    end
    
    %% Entity groups
    subgraph Entities["Entity Processing"]
        direction TB
        ProcessOrgs["🏢 Organizations"]:::enrichNode
        ProcessPeople["👤 People"]:::enrichNode
        ProcessDiscovered["🔎 Discovered Entities"]:::enrichNode
    end
    
    %% Storage
    subgraph Storage["Data Storage"]
        direction TB
        Neo4j["🕸️ Neo4j Graph DB"]:::outputNode
        KB["📚 Knowledge Base"]:::outputNode
    end
    
    %% Connect subgraphs
    ExtractEntities --> Entities
    Entities --> DataSources
    EnrichData --> DataSources
    AssessRisk --> Storage
    StoreResults --> Storage

The DAG workflow processes transactions through six key stages:

Transaction Data - Receives transaction information from the API
Extract Entities - Identifies organizations and people using Gemini AI
Enrich Data - Gathers intelligence from multiple external sources
Risk Assessment - Analyzes data to calculate risk scores with supporting evidence
Store Results - Persists findings in structured storage systems
API Callback - Notifies requesting systems when processing completes

The architecture integrates several components:

Entity Processing - Handles organizations, people, and discovered entities in parallel
Intelligence Sources - Corporate registries, sanctions lists, PEP databases, entity networks, and adverse media
Data Storage - Neo4j graph database for relationship analysis and organized knowledge base

🛠️ How We Built It

We utilized Apache Airflow for workflow orchestration, Neo4j for graph-based data modeling, and Gemini LLMs for AI-powered tasks like entity extraction and risk scoring. The frontend was built with React, Vite, and Mantine, while the backend used FastAPI for high performance and scalability.

🚧 Challenges We Faced

Key challenges included integrating diverse technologies (Airflow, Neo4j, Gemini), optimizing data pipelines for low latency, and designing an intuitive frontend. Overcoming these required innovative problem-solving, effective collaboration, and rapid iteration.

🏃 How to Run

Prerequisites

Ensure you have Docker and Docker Compose installed.
For the frontend, ensure you have Node.js, npm, and yarn installed.

Steps to Run the Project

Clone the repository

git clone https://github.com/ewfx/aidel-tech-vi-kings
cd aidel-tech-vi-kings

Set up the environment variables
Navigate to the src directory and create a .env file by copying the .env.example file. Update the variables in the .env file as needed:
```
cd code/src
cp .env.example .env
```
Create the src/data/pep folder and copy the pep_data.csv file
Create a folder named pep inside the src/data directory and copy the pep_data.csv file into it as root:
```
sudo mkdir -p data/pep
sudo cp /path/to/pep_data.csv data/pep/
```
Set up the backend
Start the backend services using Docker Compose:
```
docker-compose up --build
```
Set up the frontend
- Navigate to the client directory.
- Replace the API_URL in the constants.ts and transactionApi.ts files with the appropriate backend API URL (e.g., http://localhost:<backend-port>).
- Install dependencies and start the development server:
```
cd client
# Replace API_URL in constants.ts and transactionApi.ts
nano code/src/client/src/api/constants.ts
nano code/src/client/src/api/transactionApi.ts
# Install dependencies and start the server
yarn
yarn dev
```
Access the application
- The backend API will be available at http://localhost:8000
- The Airflow UI will be available at http://localhost:8080.
- The Neo4j UI will be available at http://localhost:7474.
- The frontend will be available at http://localhost:5173.

Steps to Run Tests

Set up the testing environment
Create a virtual environment and install the required dependencies:
```
cd code/test
make setup
```
Run all tests
Execute all test suites (BDD, unit, and API tests):
```
make test-all
```
Run specific tests
- BDD tests:
```
make bdd
```
- Unit tests:
```
make unit
```
- API tests:
```
make api
```
Generate a coverage report
Create a test coverage report:
```
make report
```
Clean up test artifacts
Remove generated files and reports:
```
make clean
```

BDD Testing

The system includes comprehensive behavior-driven development (BDD) tests to verify key functionality. These tests use the Behave framework to define scenarios in natural language that both technical and non-technical stakeholders can understand.

Test Categories

The BDD tests cover the following key risk assessment capabilities:

Sanctions Detection: Identifying transactions involving sanctioned entities or countries
PEP Detection: Recognizing Politically Exposed Persons (PEPs) and their connections
Shell Company Detection: Identifying patterns consistent with shell companies
Network Analysis: Analyzing relationships between entities in transactions
Data Enrichment: Validating data retrieval from various sources
Complex Risk Detection: Identifying multi-faceted risk scenarios
Multi-Jurisdictional Risk: Assessing risks across multiple jurisdictions

Running BDD Tests

cd tests
make setup  # Set up virtual environment and install dependencies
make bdd    # Run BDD tests

Or run an individual feature file:

cd tests
./venv/bin/behave features/sanctions_detection.feature

Example BDD Test Case

Below is an example test scenario from our test suite that verifies the system's ability to detect a sanctioned organization:

Feature: Sanctions Detection
  As a financial compliance officer
  I want to identify transactions involving sanctioned entities
  So that I can block prohibited transactions

  Scenario: Detecting a sanctioned organization
    Given a transaction with the following content:
      """
      Transaction ID: TEST-SANC-002
      Date: 2023-09-21 14:30:00

      Sender:
      Name: European Trade Solutions GmbH
      Account: DE89 3704 0044 0532 0130 00 (Deutsche Bank)
      Address: Friedrichstrasse 123, Berlin, Germany

      Receiver:
      Name: Sberbank of Russia
      Account: RU12 3456 7890 1234 5678 9012
      Address: Moscow, Russia

      Amount: $750,000 USD
      Transaction Type: SWIFT Transfer
      Reference: Equipment Purchase Contract #ER-789

      Additional Notes:
      Transfer related to energy sector equipment
      """
    When I submit the transaction
    And I wait for the transaction to complete
    Then the transaction status should be "completed"
    And the risk score should be at least 0.8
    And the extracted entities should include:
      | European Trade Solutions GmbH |
      | Sberbank of Russia           |
    And the reasoning should include any of:
      | sanction         |
      | russia           |
      | restricted       |

This test verifies that:

The system can process a transaction involving a sanctioned entity (Sberbank of Russia)
The transaction is properly analyzed and completed
The risk score meets the minimum threshold of 0.8
The system correctly extracts the relevant entities
The reasoning includes key terms related to sanctions

Test Data Validation

The tests also verify that the system correctly collects and processes data from multiple sources:

And the assessment data should include the transaction text
And the assessment data should include organization "European Trade Solutions GmbH"
And the assessment data should include organization "Sberbank of Russia"
And organization "Sberbank of Russia" should have data from "sanctions"
And organization "Sberbank of Russia" should have data from "wikidata"
And at least 1 sanctions results should be included in the assessment data

These validation steps ensure that:

The original transaction text is preserved
All organizations are correctly identified and stored
Sanctions data is retrieved for sanctioned entities
Additional enrichment data is collected from Wikidata
The minimum expected number of sanctions results are found

🏗️ Tech Stack

🔹 Frontend: React / Vite / Mantine
🔹 Backend: FastAPI
🔹 Database: Postgres / Redis
🔹 Other: Gemini API / Neo4j / Airflow

👥 Team

Indresh P - GitHub | LinkedIn
Pradeep S - GitHub | LinkedIn
Sailesh Swaminathan - GitHub | LinkedIn
Shri Hari L - GitHub | LinkedIn
Raja Kumar S - GitHub | LinkedIn

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
api		api
artifacts		artifacts
client		client
dags		dags
plugins		plugins
tests		tests
.env.example		.env.example
.gitignore		.gitignore
Dockerfile.airflow		Dockerfile.airflow
Dockerfile.fastapi		Dockerfile.fastapi
LICENSE		LICENSE
Readme.md		Readme.md
docker-compose.yml		docker-compose.yml
graph.mermaid		graph.mermaid
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 AML Risk Assessment

📌 Table of Contents

🎯 Introduction

🎥 Demo

🖼️ Screenshots

⚙️ What It Does

🛠️ How We Built It

🚧 Challenges We Faced

🏃 How to Run

Prerequisites

Steps to Run the Project

Steps to Run Tests

BDD Testing

Test Categories

Running BDD Tests

Example BDD Test Case

Test Data Validation

🏗️ Tech Stack

👥 Team

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

indreshp135/eira-wf-hackathon

Folders and files

Latest commit

History

Repository files navigation

🚀 AML Risk Assessment

📌 Table of Contents

🎯 Introduction

🎥 Demo

🖼️ Screenshots

⚙️ What It Does

🛠️ How We Built It

🚧 Challenges We Faced

🏃 How to Run

Prerequisites

Steps to Run the Project

Steps to Run Tests

BDD Testing

Test Categories

Running BDD Tests

Example BDD Test Case

Test Data Validation

🏗️ Tech Stack

👥 Team

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages