ADB

Content

click to expand

Installation
OCR
Extraction
Matching
Statistics
Tools
Code
Diagram

Installation

Setup Google Cloud account

Prerequisites

Before you can use Document AI; one of the many tools of Vertex AI, you need to make sure that you have a Google Cloud Platform project, the Document AI API enabled, a document processor created, and service account credentials downloaded.

Instructions

Create a Google Cloud Platform project. To do this, go to the Google Cloud Console and click on the "Créer un projet" button.
Enable the Document AI API. To do this, go to the "APIs & Services" page of the Google Cloud Console and search for Document AI. Click on the "Enable" button.
Create a document processor. To do this, go to the "Processeurs de documents" page of the Google Cloud Console and click on the "Créer un processeur button".
Download the service account credentials. To do this, go to the "Clés d'API" page of the Google Cloud Console and click on the "Créer une clé" button. Select "Clé JSON" and click on "Créer".

You can find more details and the links to directly do this tasks in the following tutorials :

Set up the Document AI API : https://cloud.google.com/document-ai/docs/setup
Use Enterprise Document OCR to process documents : https://cloud.google.com/document-ai/docs/process-documents-ocr

Configuration

Once you have completed the prerequisites, you need to configure your project to use Document AI. This involves providing the project ID, location, processor name, and service account credentials path.
Provide the project ID: The project ID is the unique identifier for your Google Cloud Platform project. You can find the project ID in the "Project ID" field of the "APIs & Services" page in the Google Cloud Console.
Provide the location: The location is the region where your document processor is running. You can find the location in the "Location" field of the "Processors" page in the Google Cloud Console.
Provide the processor name: The processor name is the unique identifier for your document processor. You can find the processor name in the "Processor ID" field of the Processors page in the Google Cloud Console.
Provide the service account credentials path: The service account credentials path is the path to the JSON file that contains the service account credentials. You can download the service account credentials file from the "Clés d'API" page in the Google Cloud Console.

You'll have to fill this informations in both the config.json and google.json, the first one will have to be filled manually and the latter one is the JSON file downloaded from the Google Cloud Console.

Create OpenAI account

Create an account on OpenAI website
Create an API key in API keys section of your account
Copy the API key and paste it in the following file : "src/config.json" in openai, api_key field

Manage the funds of your account in the billing section of your account, a minimal amount of 5$ is required to use the API.

Init project

Clone project and install dependencies (python version 3.10)

git clone https://github.com/Boukebya/ADB.git
cd adb
pip install -r requirements.txt

If error no module named "pandas" or "flask" occurs, try the following command :

pip install pandas
pip install flask

After that, you should be able to run the following command without any error

python src/api.py

If the API is running, then you can call it using it's path : http://127.0.0.1:5000/use_vertex/${path_img}

OCR

OCR is the process of converting images of text into machine-encoded text. To convert our supply list image into text, we use the OCR API of google, google vision, which is a google cloud service. It is important to change config file with your own google vision api key. When using the API, the image will be transfered to google cloud, and the result will be returned as a json file in "ocr.txt"

Extraction

Extraction is the process of extracting the information from the text, basically, we want to extract the name of the product, the quantity and all useful information from the text, like weight, dimensions, etc. To do that, we use the OpenAI API, which is a cloud service that uses machine learning to extract information from text. We use GPT-3.5 model, which is a model that is trained on a large dataset of text, and is able to extract information from text. The result of the extraction is returned as a json file in "ocr.txt" We can also input "classe", which corresponds to the school level of the supplies that we want to extract. The output of extraction will be made with the following fields : "name", which is the name of the information that we found with all information, "article", which is the name of the product, "quantity", which is the quantity of the product.

Matching

Matching is made by using the extracted information from the text, and matching it with the information in the database, called "annuaire.json". This database can be modified and contains "texte" and "reference" fields. We compare our output from extraction with all our products in the database, and we return the product that has the highest score. The score is calculated by using a similarity function, the levenstein distance, which is a function that calculates the distance between two strings, and a custom methods that work with points based on the similarity of the words between the two strings. Some custom methods are also made do determine the good product, for example for books or paper.

Statistics

Here is a resume of results of the project :

Tools

Some tools can still be used to custom the project :

One interface to modify database
One methods to preprocess a database

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
.idea		.idea
data		data
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
ocr.txt		ocr.txt
requirements.txt		requirements.txt
result.txt		result.txt
test.txt		test.txt
test_matching.png		test_matching.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ADB

Content

Installation

Setup Google Cloud account

Create OpenAI account

Init project

OCR

Extraction

Matching

Statistics

Tools

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

Boukebya/ADB

Folders and files

Latest commit

History

Repository files navigation

ADB

Content

Setup Google Cloud account

Create OpenAI account

Init project

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Uh oh!

Languages