p
prateek_715

Prateek T

@prateek_715

Data Engineer

Índia
Inglês, Hindi
Algumas informações são exibidas no idioma inglês.
Sobre mim
I am a Data Engineer with hands-on experience in PySpark, Kafka, Python, SQL, and the Hadoop ecosystem. Currently, I build large-scale data pipelines and ETL workflows at Infosys, focusing on medallion architecture and Spark optimization. I have a strong foundation in ML-powered data products and experience taking projects from EDA to deployed APIs.... Saiba mais

Habilidades

p
prateek_715
Prateek T
offline • 
Tempo médio de resposta: 1 hora

Conheça meus serviços

Fórmulas e Macros
I will solve your excel problems

Experiência profissional

Infosys

Data Engineer

Infosys • Período integral

Sep 2025 - Present10 mos

Deployed on Databricks platform; helped build production pipelines processing daily 2–9 GB datasets (7-12 million rows): designed schema transformations for medallion architecture, engineered PySpark optimizations (partition pruning, shuffle hash, broadcast joins), implemented data serialization tuning; optimizations reduced job execution time by upto 20% in some pipelines. Led data quality validation, schema design improvements, and schema evolution to accommodate upstream data changes; worked cross-functionally with team lead and senior engineers on parallelism optimization strategies