CV

Basics

Name Amirsina Torfi
Label Machine Learning Engineer
Email amirsina{dot}torfi{at}gmail{dot}com
Url https://astorfi.github.io
Summary Accomplished Machine Learning Engineer specializing in Generative AI, NLP, and Computer Vision with over 8 years of experience in the field, including roles at Meta, Nike, and Huma AI.

Work

  • 2024.05 - Present
    Lead Machine Learning Engineer - Tech Lead
    Nike
    Directing the development of advanced machine learning and generative AI systems to revolutionize personalized consumer experiences and design process. Focused on integrating AI into Nike's design, testing, and consumer engagement initiatives to drive innovation and optimize operational efficiency.
    • Leveraged generative AI for enhanced image search, enabling precise and visually-driven product discovery.
    • Designed a recommendation system integrating vision models and Large Language Models (LLMs) to deliver highly personalized design suggestions.
    • Implemented collaborative filtering techniques in recommendation systems, improving accuracy and relevance for user-specific product recommendations.
    • Applied generative AI to enhance semantic segmentation and feature learning, boosting model performance in understanding complex visual data.
  • 2022.05 - 2024.05
    Machine Learning Research Scientist
    Meta (Facebook)
    Led cutting-edge AI research and development, focusing on the integration of NLP, Computer Vision, and Generative AI to drive advancements in consumer engagement and content personalization. Directed end-to-end initiatives from conceptualization through deployment, consistently enhancing computational efficiency and accuracy across Meta's digital platforms.
    • Enhanced semantic segmentation accuracy by 7% and boosted user engagement by 15% using advanced Vision Transformer (ViT) and SWIN Transformer models.
    • Deployed Retrieval-Augmented Generation (RAG) systems with Large Language Models (LLMs) to improve query relevance and user satisfaction.
    • Tripled computational efficiency with mixed-precision training and model quantization, reducing costs and increasing revenue by 33%.
    • Developed cross-modal AI systems integrating NLP and Computer Vision, advancing capabilities in Visual Question Answering (VQA) and Image Captioning.
    • Led deployment of generative AI models for personalized content, using diffusion models and transformers to elevate user experience on Meta platforms.
    • Optimized model deployment with TorchScript and ONNX for low-latency, real-time performance across diverse hardware.
    • Built large-scale data pipelines with SQL and BigQuery, improving data-driven decision-making and model training efficiency.
  • 2020.06 - 2022.05
    Senior Data Scientist
    Huma AI
    Specialized in developing and optimizing automated Q&A systems powered by large language models (LLMs) to drive revenue growth and operational efficiency. Led initiatives that leveraged cutting-edge NLP techniques to deliver accurate, responsive answers and streamline workflows, contributing to substantial cost savings and enhanced user experience.
    • Boosted answer accuracy by 20% and reduced response latency by 40% using zero-shot learning and clustering, enhancing system responsiveness and accuracy.
    • Optimized end-to-end data flow for Q&A, cutting manual labeling costs by 70% with automation, leading to significant cost savings.
    • Integrated AWS services, including SageMaker, for scalable Q&A model deployment, ensuring high performance and reliability in production.
    • Generated over $1M in annual revenue by improving model accuracy and system reliability, making the Q&A system a key revenue asset.
    • Built robust NLP pipelines using frameworks like Hugging Face and LangChain, enhancing model training efficiency and system responsiveness.
  • 2018.01 - 2020.08
    Graduate Research Assistant
    Virginia Tech
    Conducted research on privacy-preserving techniques for deep learning, focusing on federated learning in NLP and Computer Vision.
    • Developed privacy enhancement algorithms for large language models.
  • 2016.01 - 2017.12
    Graduate Research Assistant
    West Virginia University
    Researched the integration of computer vision and speech technologies for multimodal interaction systems.
    • Engineered a lip-reading system to enhance communication aids for the hearing-impaired.

Education

  • 2018.01 - 2020.08

    Blacksburg, Virginia

    PhD
    Virginia Polytechnic Institute and State University (Virginia Tech)
    Computer Science
    • Thesis: Synthetic data generation for privacy-preserving deep learning in medical applications.

Awards

  • 2020.08
    Best Ph.D. Thesis Award
    Virginia Tech
    Awarded for significant contributions to generative AI-driven synthetic data generation for healthcare.

Skills

Machine Learning & AI
Generative AI
Transformers (BERT, GPT-3, T5)
Computer Vision
NLP (Natural Language Processing)
Sequence-to-Sequence Models
Self-Supervised Learning
Zero-Shot Learning
Transfer Learning
Model Optimization & Compression (Quantization, Mixed-Precision)
Federated Learning
Privacy-Preserving ML
Domain Applications
Medical Imaging
Sports Analytics
AR/VR Applications
E-commerce Personalization
Digital Twins
Human Pose Estimation
Multimodal Interaction Systems
Predictive Modeling
Cluster Analysis
Frameworks & Tools
TensorFlow
PyTorch
Keras
Hugging Face
LangChain
LlamaIndex
SQL
BigQuery
Docker
AWS SageMaker
Vertex AI (GCP)
TorchScript
ONNX
MLflow
Weights & Biases
Optuna
Data Processing & Analysis
Data Preprocessing (Normalization, Scaling, Augmentation)
Model Evaluation Metrics (AUC-ROC, F1-score, mIoU)
Advanced Statistical Analysis
Experiment Tracking
A/B Testing
K-Fold Cross-Validation
Hyperparameter Tuning
ETL Pipelines
Data Warehousing (Snowflake, Redshift)
Programming & Systems
Python
Shell Scripting
SQL & BigQuery
CI/CD
PostgreSQL
DynamoDB
Elasticsearch
Version Control (Git, Mercurial)
React (Full-Stack Development)
Cloud & Distributed Computing
AWS (Lambda, ECR, SageMaker)
GCP (Vertex AI)
Apache Spark
Hadoop
Apache Kafka
Distributed Training
Visualization & Reporting
Seaborn
Matplotlib
Plotly
Tableau
Power BI
Dashboard Development
Interactive Visualizations
Project Management & Collaboration
Agile Methodologies
Team Leadership & Mentoring
Cross-Functional Collaboration
Slack
JIRA
Confluence
Data Governance
Strategic Planning
Stakeholder Communication

Languages

English
Fluent
Persian
Native

Interests

Machine Learning & Data Science
AI Ethics
Data Privacy
Federated Learning
Synthetic Data Generation
Deep Learning Trends
Explainable AI (XAI)
Bias & Fairness in AI
Edge AI
Human-AI Collaboration
Self-Supervised Learning
Reinforcement Learning Applications
Applications of AI
Medical Imaging
Augmented & Virtual Reality (AR/VR)
Personalized User Experiences
Digital Twins
Smart Retail
Environmental Sustainability through AI
Emerging Technologies
Quantum Computing for AI
Neurosymbolic AI
Neuromorphic Computing
Graph Neural Networks
Blockchain for Data Security

Projects

  • Open Source Projects on GitHub
    A collection of machine learning, computer vision, and NLP projects.
    • Applications in Generative AI
    • Multi-modal AI
    • Optimization for real-time systems