WEB
app
personal

CVClassificationSystem

Advanced CV classification and analysis system using multiple OCR extractors, NLP with spaCy/transformers, ML with scikit-learn, and local LLM via Ollama.

100% solo developer, Multi-OCR + ML + LLM pipeline

100.0% contribution

Quick Stats

Period

Sep 2025

Commits

1 / 1

My commits / Total

Team Size

0

Sole Developer

Contribution

100%

Sole Developer

Project Metrics

0%Contribution
My Commits
1 / 1

Period

Sep 2025

Role

Sole Developer

Team

1 person

90%+

Text Extraction

85%+

Skill Precision

GDPR

Privacy

$0

API Cost

Tech Stack

Python 3.8+
Tesseract
EasyOCR
Doctr
spaCy
NLTK
transformers
scikit-learn
Ollama
Flask
SQLite

Key Features

Multi-Extractor OCR

Tesseract, EasyOCR, Doctr with voting system for best results

NLP Pipeline

spaCy and transformers for named entity recognition

ML Classification

scikit-learn for skill categorization with confidence scoring

LLM Enhancement

Ollama local for contextual analysis with privacy

Geolocation

Automatic location detection from phone numbers

Web Dashboard

Flask UI for visualization and management

My Contribution

0.0%Contribution

Role

Sole Developer

Key Contributions

  • Multi-stage pipeline architecture
  • 4 OCR extractor implementations
  • spaCy and transformers integration
  • ML scoring and classification system
  • Flask web dashboard

Achievements

90%+

Text Extraction

85%+

Skill Precision

GDPR

Privacy

$0

API Cost

Challenges Solved