Dr. Maya Thompson

Senior Data Scientist | Machine Learning Engineer

📧 maya.thompson@email.com | 📱 (555) 345-6789 | 🔗 linkedin.com/in/mayathompson | 💻 github.com/mayathompson | 📊 mayathompson.github.io

Boston, MA

Professional Summary

Ph.D. Data Scientist with 6+ years of experience building production ML systems and extracting actionable insights from complex datasets. Expert in deep learning, NLP, and statistical modeling with proven track record of deploying models that generated $30M+ in business value. Published researcher with 8 peer-reviewed papers and 200+ citations.

Technical Skills

Programming Languages: Python, R, SQL, Scala, Java

ML/AI Frameworks: PyTorch, TensorFlow, Scikit-learn, XGBoost, LightGBM, Hugging Face Transformers

NLP & Deep Learning: BERT, GPT, Transformers, LSTM, CNN, Attention Mechanisms, Transfer Learning

Data Engineering: Apache Spark, Airflow, Kafka, dbt, ETL/ELT Pipelines

Databases: PostgreSQL, MySQL, MongoDB, Snowflake, BigQuery, Redis

Cloud Platforms: AWS (SageMaker, EMR, S3, Lambda), GCP (Vertex AI, BigQuery), Azure ML

Tools & Libraries: Pandas, NumPy, SciPy, Matplotlib, Seaborn, Plotly, Jupyter, MLflow, Weights & Biases

Statistical Methods: A/B Testing, Hypothesis Testing, Bayesian Inference, Causal Inference, Time Series Analysis

MLOps: Docker, Kubernetes, CI/CD, Model Monitoring, Feature Stores, Model Versioning

Professional Experience

Senior Data Scientist | FinancialTech Corp | Boston, MA

January 2021 - Present

Built and deployed fraud detection system using ensemble methods (XGBoost + Neural Networks) that prevented $18M in fraudulent transactions annually while reducing false positives by 40%
Developed NLP-powered chatbot using fine-tuned BERT model, handling 70% of customer inquiries and reducing support costs by $2M/year
Created customer churn prediction model with 89% precision, enabling proactive retention campaigns that decreased churn by 15%
Led ML infrastructure modernization, implementing MLOps best practices with MLflow and Kubernetes, reducing model deployment time from weeks to hours
Managed team of 3 data scientists, conducting code reviews and mentoring on advanced ML techniques
Designed and ran 50+ A/B tests, optimizing product features and marketing campaigns with statistical rigor
Built real-time recommendation engine serving 1M+ users, increasing revenue per user by 25%

Key Technical Achievements:

Reduced model training time by 70% through distributed computing with Apache Spark
Implemented feature store that eliminated data inconsistencies and reduced engineering overhead by 50%
Built automated model monitoring system detecting data drift and performance degradation

Data Scientist | E-Commerce Platform | Cambridge, MA

June 2018 - December 2020

Developed dynamic pricing algorithm using reinforcement learning that increased profit margins by 12% while maintaining sales volume
Built computer vision model for product image classification with 95% accuracy, automating catalog management for 500K+ products
Created customer segmentation using unsupervised learning (K-means, DBSCAN), enabling personalized marketing campaigns that improved conversion by 28%
Implemented time series forecasting models (Prophet, ARIMA, LSTM) for inventory optimization, reducing stockouts by 35%
Built ETL pipelines processing 10TB+ of data daily using Apache Airflow and Spark
Collaborated with engineering teams to productionize 15+ ML models with 99.9% uptime
Conducted exploratory data analysis to identify $5M revenue opportunity in underserved customer segment

Research Data Scientist | MIT Media Lab | Cambridge, MA

September 2016 - May 2018

Conducted research on social network analysis using graph neural networks, resulting in 3 published papers
Developed novel NLP algorithms for sentiment analysis and topic modeling on social media data
Built predictive models for user behavior patterns with applications in public health and social good
Collaborated with interdisciplinary team of 10+ researchers across computer science, psychology, and sociology
Mentored 5 undergraduate researchers on data science projects and methodologies
Presented research findings at top-tier conferences (NeurIPS, ICML, ACL)

Education

Ph.D. in Computer Science (Machine Learning)

Massachusetts Institute of Technology (MIT) | Cambridge, MA
Graduated: May 2018
Dissertation: "Deep Learning Approaches for Social Network Analysis and Behavior Prediction"
Advisor: Prof. Sandra Chen

Master of Science in Statistics

Stanford University | Stanford, CA
Graduated: June 2014
GPA: 3.9/4.0

Bachelor of Science in Mathematics & Computer Science

University of California, Berkeley | Berkeley, CA
Graduated: May 2012
Summa Cum Laude, GPA: 3.95/4.0

Publications & Research

Thompson, M., et al. (2023). "Robust Fraud Detection Using Ensemble Deep Learning." ACM Transactions on Knowledge Discovery from Data. [DOI: 10.1145/xxxxx]
Thompson, M., & Chen, S. (2018). "Graph Neural Networks for Social Influence Prediction." Advances in Neural Information Processing Systems (NeurIPS). [Citations: 95]
Thompson, M., et al. (2017). "Transfer Learning for Low-Resource NLP Tasks." Association for Computational Linguistics (ACL). [Citations: 67]
Liu, J., Thompson, M., et al. (2019). "Interpretable Machine Learning for Healthcare Applications." Journal of Machine Learning Research. [Citations: 52]

Total Citations: 200+
h-index: 6
i10-index: 18

Projects & Open Source

ML Model Deployment Framework (GitHub: 2.1K stars)

Created open-source Python library for simplified ML model deployment
Used by 50+ companies for production ML workflows
Featured on PyData Conference and DataScience Weekly

Kaggle Competitions

Gold Medal (2x): NLP Competition (Top 1%), Image Classification (Top 2%)
Kaggle Master ranking (Top 1% of 150K+ users)
Shared competition solutions with 10K+ views

Personal Projects

Stock Market Prediction: Deep learning models using LSTM and attention mechanisms
Climate Data Analysis: Time series analysis of global temperature trends with interactive visualizations
Automated Research Paper Summarizer: Fine-tuned GPT model for scientific paper summaries

Awards & Honors

MIT Presidential Fellowship (2014-2018) - Full scholarship for doctoral studies
Best Paper Award - NeurIPS Workshop on Social Good (2017)
Outstanding Graduate Student Instructor - UC Berkeley (2013)
Grace Hopper Scholarship - Awarded for excellence in computing (2016)

Certifications

AWS Certified Machine Learning - Specialty (2022)
TensorFlow Developer Certificate - Google (2021)
Deep Learning Specialization - deeplearning.ai (Andrew Ng) (2020)
Advanced SQL for Data Scientists - Mode Analytics (2019)

Teaching & Mentorship

Guest Lecturer - MIT Course 6.867 (Machine Learning), Harvard CS109 (Data Science) (2020-Present)
Workshop Instructor - "Introduction to Deep Learning" at PyData Boston (2022, 2023)
Data Science Mentor - Mentored 15+ aspiring data scientists through Springboard and SharpestMinds
Technical Blog - Published 20+ tutorials on ML topics (50K+ views on Medium and personal blog)

Conference Presentations

"Production ML Systems at Scale" - MLOps World (2023)
"NLP in FinTech: Challenges and Solutions" - PyData Boston (2022)
"Interpretable AI for Business Applications" - Strata Data Conference (2021)

Additional Information

Professional Memberships: ACM, IEEE, Women in Machine Learning (WiML)
Languages: English (Native), French (Intermediate)
Interests: Ethical AI, interpretable ML, data privacy, mentoring women in STEM

This resume was created using markdown and converted to PDF at MarkdownResume.app

Data Scientist Resume Template