Saif Mahin
Vetted Pro
Level 2
Python Developer: AI Data Extraction and Web Scraping
Verificado pelo Fiverr Pro
Saif Mahin foi selecionado pela equipe do Fiverr Pro considerando sua experiência.
Verificado para
Data Scraping
Habilidades

Conheça meus serviços


Quer trabalhar com remuneração por hora?
Diga a Saif Mahin o que você precisa.
US$ 20
/
horaPortfólio
Experiência profissional
Python Developer
SupplyCopia • Período integral
Dec 2022 - Present • 3 yrs 5 mos
As a Python Developer at Supply Copia, I build scalable data pipelines, AI-powered document processing systems, and automation frameworks that handle large-scale unstructured data with high accuracy. What I've built and delivered: Document & Invoice Processing: Designed end-to-end invoice extraction pipelines processing 100K+ documents monthly, transforming unstructured PDFs into clean, structured datasets (Excel, CSV, Parquet). Built AI-assisted parsing using OpenAI APIs and LangChain to resolve field ambiguities and boost extraction accuracy. Created automated QA frameworks to catch mismatches in amounts, vendors, and invoice numbers at scale. AI & Intelligent Systems: Integrated embedding models and re-rankers (BGE) for schema mapping and intelligent column matching. Contributed to AI chatbot development, connecting LLMs with structured data and knowledge bases. Led automation initiatives using AWS Lambda, reducing manual effort and improving processing speed. Web Scraping & Automation: Engineered high-performance scraping systems with concurrency, retry logic, proxy rotation, and anti-bot strategies for large-scale data collection. Built and deployed REST APIs using FastAPI and Flask for internal tools and data workflows. Designed S3-based orchestration workflows for storing and processing structured outputs. Data Engineering & Analytics: Developed Snowflake-based data pipelines with monthly partitioned tables and consolidated reporting layers. Built data reconciliation systems using fuzzy matching (RapidFuzz), normalization, and rule-based + AI logic. Implemented parallel processing (ThreadPoolExecutor, batching, checkpointing) to handle thousands of vendors efficiently. I work closely with cross-functional teams to deliver reliable, production-ready solutions that drive data accuracy, automation, and business efficiency.
149 Avaliações
| (143) | ||
| (6) | ||
| (0) | ||
| (0) | ||
| (0) |
Classificação detalhada
- Nível de comunicação do freelancer
- Qualidade da entrega
- Valor da entrega
Ordenar por
martijnp17
Cliente recorrente

Holanda
Happy with the work Saif delivers! We've placed 18 orders at this moment of time.

garricklau

Estados Unidos
he took the time to understand exactly what I needed and produced and documentation that proved his skill
vindavis1

Austrália
Saif was really good. He knows what we are after. Great communication and we got what we promised.
p_dmdr
Cliente recorrente

Holanda
Excellent work, just like last year. Will come back next year.
leonardodurso

Azerbaijão

