About

👋 Hello! I'm Lakshmi Priya Ramisetty, an experienced and certified Data and Machine Learning Engineer with a robust background in both data engineering and machine learning, and a specialized focus in Generative AI. I hold a Bachelor’s degree in Computer Science and a Master’s in Artificial Intelligence, which together provide a solid academic foundation that I leverage to tackle complex, real-world challenges in AI and data-driven solutions.

đź’ľ In Data Engineering, I excel at designing and implementing resilient data pipelines on Google Cloud Platform (GCP), crafting end-to-end data architectures that support high-performance AI applications. My data engineering expertise spans data ingestion, ETL processes, and pipeline optimization, where I use tools like Dataflow, BigQuery, Apache Beam, and Airflow to manage, automate, and scale data workflows. I have a strong command of SQL and Python, ensuring efficient data integration and processing within complex systems.

🛠️ As an ML Engineer, my work in large language models (LLMs) reflects a comprehensive understanding of their architecture and deployment. I specialize in both fine-tuning existing LLMs and building models from scratch to fit unique application needs. With expertise in prompt engineering, model optimization, and advanced inference techniques like model distillation and quantization, I create scalable, high-performance AI solutions that are ready for production. My work with CI/CD integration further enables seamless model deployment and continuous iteration, ensuring models remain relevant and effective over time.

🚀 I am passionate about pushing the boundaries of what AI can achieve and am always excited to connect with professionals and innovators in the field. Let’s connect, share insights, and explore opportunities to create impactful AI-driven solutions together!

  • Birthday: 14 August 1998
  • Phone: +1 551-689-1876
  • City: Redwood City, CA
  • Email: lakshmipriya.ramisetty@gmail.com

Interests

Data Engineering

Machine Learning

Computer Vision

Natural Language Processing

Deep Learning

Visualization

Algorithms

Generative AI

Education

MS in Artificial Intelligence

Aug 2023 - Dec 2024
Relevant Coursework
  • Neural Networks and Deep Learning
  • Natural Language Processing
  • Machine Learning
  • Artificial Intelligence
  • Numerical Methods

Executive Post Graduate Programme in Data Science (Major in Data Engineering)

Jul 2021 - Aug 2022
Relevant Coursework
  • Data Engineering
  • Machine Learning
  • Data Toolkit

Integrated MTech in Computer Science (Major in Data Science)

Aug 2015 - Jun 2020
Relevant Coursework
  • Database Management Systems
  • Data Structures & Algorithms
  • Programming Languages

Certifications

Google Cloud Certified Professional Machine Learning Engineer

Google Cloud Certified Professional Data Engineer

Google Cloud Certified Associate Cloud Engineer

John Hopkins University - Practical Machine Learning

John Hopkins University - Statistical Inference

John Hopkins University - R Programming

Experience

Data Engineer - GenAI

InTheLoop INC

Oct 2024 - Present

  • Fine-tuned Vision-Language Models (VLMs), such as Qwen2-VL and Gemini 1.5 Pro, using Supervised Fine-Tuning (SFT) techniques to create a fabric damage detection system and integrated dynamic pricing strategies to adjust item pricing based on their damage levels.
  • Optimized Qwen2-VL by modifying DeepSpeed’s pipeline to support Vision-Language Models (VLMs), improving inference speed by 2.5x.
  • Accelerated ML model deployment and lifecycle management using Vertex AI to automate model training, fine-tuning, and deployment.
  • Automated GCP infrastructure provisioning using Terraform to manage and deploy resources for data engineering and ML workflows.
  • Designed a scalable pipeline using Apache Beam, Dataflow, to scrape and download over 125,000 images from the web, streamlining large-scale data collection with a microservices architecture
  • Leveraged DINO v2 and KMeans clustering to group images based on feature embeddings, followed by background removal
  • Utilized Gemini 1.5 Pro for good and bad labelling classification to enhance background removal accuracy, refining dataset quality through prompt-based fine-tuning

Data, Machine Learning Engineer Volunteer

PopStock Educational Services

Aug 2024 - Dec 2024

  • Built a serverless RAG system on Google Cloud using BigQuery, Gemini, and Cloud Functions to deliver real-time, context-aware stock market insights while reducing operational costs by 60% through efficient resource utilization and serverless architecture.
  • Launched a simulated stock market platform pilot program, successfully introducing it to 5 schools in New York to enhance financial literacy and hands-on learning for students.
  • Automated JSON-based textual script generation for animations using Gemini 1.5 Pro, enabling seamless integration with Unreal Engine.
  • Collected and preprocessed over 100 million records of historical stock data for Fortune 100 companies, ensuring data integrity through advanced feature engineering and normalization techniques.
  • Designed and deployed a complex multi-layer LSTM model, improving predictive accuracy by 30% through detailed hyperparameter tuning and regularization.

AI/ML Engineer Intern

FinServ Experts

May 2024 - Aug 2024

  • Fine-tuned state-of-the-art Large Language Models (LLMs) like Llama2 and Mistral using parameter-efficient techniques such as LoRA and QLoRA, achieving highly accurate and contextually relevant responses for financial applications.
  • Created custom, domain-specific datasets tailored to financial use cases, including investment analysis, risk assessment, and regulatory compliance, optimizing model performance and reducing false positive rates by 35% in critical financial workflows.

Data Engineer

Egen

Mar 2021 - Jul 2023

  • Developed several ETL pipelines leveraging Dataflow, Pub/Sub, Dataproc and BigQuery, processing and harmonizing large datasets 50% faster than the industry standard.
  • Led the development of a comprehensive end-to-end healthcare demonstration using 7+ GCP tools, including BigQuery, BigQuery ML, Vertex AI, and Dialogflow, showcased at Google’s headquarters, illustrating advanced data analytics and AI capabilities.
  • Migrated 8 dashboards from Tableau to Looker, creating LookML models from scratch that significantly enhanced data visualization.
  • Established standards for hosting, CI/CD pipelines, Docker containerization, and Kubernetes orchestration. Implemented secure networking with GCP VPC, subnets, firewall rules, and IAM role-based access controls to enhance efficiency and security
  • Achieved a 70% reduction in pipeline execution time by implementing a decision tree harmonization strategy, optimizing data processing workflows.
  • Ingested and processed 10 TB of healthcare data comprising over 400 million records in BigQuery for high-volume analysis.
  • Optimized SQL query performance on large datasets using database partitioning, reducing execution time from 10 minutes to 2 minutes.
  • Utilized Google Cloud Workflows to orchestrate complex ETL pipelines, incorporating robust error handling and automated retry mechanisms to ensure reliability and achieving an impressive average uptime of 99.9%.

Machine Learning Engineer Intern

Accenture

Jan 2020 - Jul 2020

  • Automated document processing using Tesseract OCR, reducing processing time to 2 seconds per document and enhancing data extraction accuracy by 30%.
  • Conducted A/B tests to evaluate the effectiveness of different OCR algorithms, resulting in a 25% improvement in data extraction accuracy and efficiency.
  • Researched reinforcement learning techniques, including Q-Learning, DQN, and PPO, for Collaborative Robots (CoBots) to enhance adaptive behavior and interaction, supported by a comprehensive literature review and detailed software bot taxonomy.

Projects

  • All
  • Data Engineering
  • Data Science

Information Retrieval

Video Generation with Diffusion Models
Blue Shield of California - Data Ingestion and Visualization

Provider Directory Mapping

Data Capture and Analysis of Cab Rides

Machine Learning A-Z

Hands-On Machine Learning

Hands-On Machine Learning

RainFall Prediction

Toxic Comment classification

Blog

Skills

Languages & Databases

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Libraries & Frameworks

vectorlogo.zone vectorlogo.zone upload.wikimedia.org vectorlogo.zone vectorlogo.zone vectorlogo.zone upload.wikimedia.org

Cloud Tools

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Big Data, ETL & APIs

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Tools

vectorlogo.zone vectorlogo.zone

Contact

My Address

San Mateo, CA 94403

Social Profiles

Email

lakshmipriya.ramisetty@gmail.com

laasya71314@gmail.com

Contact

+1 551-689-1876