Software developer with strong experience designing large-scale data platforms and machine learning infrastructure using Python, Spark, and AWS. Proven track record building distributed data pipelines, streaming systems, and scalable data lakes supporting massive datasets. Experienced in Apache Spark, Kafka, Airflow, and EMR for batch and real-time data processing, as well as building end-to-end ML pipelines that significantly reduce model training time and improve production performance. Skilled in architecting cloud-native data ecosystems and enabling data-driven decision making through scalable analytics platforms.