Sofi Altamsh

NLP Engineer

Bangalore, Karnataka

About Me

I'm a passionate Natural Language Processing (NLP) Engineer & AI Enthusiast, driven by the power of data and artificial intelligence. I specialize in building 🔥 data-driven solutions with expertise in Python, data analysis, NLP, LLMs, Deep Learning and cutting-edge machine learning techniques. My goal is to create or train impactful models that solve real-world problems and push the boundaries of AI.

Skills

Machine Learning

  • Regression
  • Clustering
  • PCA
  • Preprocessing
  • Hyperparameter Tuning

Deep Learning

  • CNN, RNN
  • GANs
  • Transformers ( RoBERTa, T5, GPT)
  • LoRA

MLOps

  • Streamlit
  • Flask
  • HDFS, AWS Lambda
  • Vector Databases (FAISS)
  • MySQL, MongoDB

NLP & LLMs

  • Text2Text
  • RAG, TF-IDF
  • Word2Vec, NLTK, NER
  • Text Summarization
  • Hugging Face, LangChain

Visualization

  • Power BI
  • Matplotlib
  • Metabase
  • Seaborn
  • Excel

Libraries & Frameworks:

  • OCR
  • OpenCV
  • SciPy, PyTorch
  • Pandas, NumPy
  • TensorFlow, Scikit-learn
Regression Clustering PCA Preprocessing Hyperparameter Tuning CNN RNN GANs Transformers (RoBERTa, T5, GPT) LoRA Streamlit Flask HDFS AWS Lambda Vector Databases (FAISS) MySQL MongoDB Text2Text RAG TF-IDF Word2Vec NLTK NER Text Summarization Hugging Face LangChain Power BI Matplotlib Metabase Seaborn Excel OCR OpenCV SciPy PyTorch Pandas NumPy TensorFlow Scikit-learn Regression Clustering PCA Preprocessing Hyperparameter Tuning CNN RNN GANs Transformers (RoBERTa, T5, GPT) LoRA Streamlit Flask HDFS AWS Lambda Vector Databases (FAISS) MySQL MongoDB Text2Text RAG TF-IDF Word2Vec NLTK NER Text Summarization Hugging Face LangChain Power BI Matplotlib Metabase Seaborn Excel OCR OpenCV SciPy PyTorch Pandas NumPy TensorFlow Scikit-learn

Experience

NLP Engineer

Masai, Bengaluru

July 2025 - Present

Natural Language Processing Engineer
  • Automating ticket resolution processes with LLMs using LangChain-based RAG pipelines and AI agent frameworks.
  • Reduced user query response time by 92% (to less than 3.2 seconds) by building an Answering System using RAG pipeline
  • Built and deployed ML/DL pipelines for internal analytics, helping teams quickly identify talent trends.
Machine Learning Engineer (8 Months)
  • Increased 23% hiring rate of Masai students by analysing their performance and developing a CTC Prediction Model
  • Trained ML and DL models then deployed them in production. Analyzed data trends and created visualizations on Metabase
Data Research Analyst (4 months)
  • Worked on Data pipeline, Top Hiring Companies Analysis project and delivered insights to increase placement rate by 15%.
Data Analyst Associate (6 months)
  • Analyzed job market data across LinkedIn, Naukri, and Instahyre to extract hiring trends.
  • Supported the data team in structuring and cleaning large-scale recruitment datasets.

Data Science Intern

1Stop.ai, Bengaluru, Karnataka

Jun 2022 - Nov 2022 (6 months)

  • Trained predictive models for house loan value with more than 90% accuracy
  • Enhanced performance using PCA, visualized clusters & created dashboards
  • Evaluated multiple regression models (Linear, XGBoost, Random Forest) to select the best-performing one
  • Automated model evaluation workflows using Python and Scikit-learn pipelines
  • Documented model assumptions, data schema, and experiment results to support reproducibility

Projects

Face Mask Detection

Real-Time Global Case Monitoring 🌍📊⏱️

  • Built a real-time system using TensorFlow and OpenCV achieving 94.6% accuracy trained with 1500+ images
  • Tech Stack: Python, TensorFlow, OpenCV, Keras, NumPy
  • Designed user-friendly frontend for monitoring virus spread

Fake News Detection

Using Multi-Modal Learning 📰🤖🔍

  • Built a hybrid model combining RoBERTa for text and CNNs for image inputs to detect fake news articles
  • Achieved 88.7% accuracy on a custom multimodal dataset of 10,000+ samples.
  • Tech Stack: Python, Transformers (Hugging Face), RoBERTa, CNN, PyTorch, NumPy

NLP Chatbot

Smart chatbot that understands user intent and responds conversationally 🤖💬

  • Built an NLP-powered chatbot using Python and Hugging Face Transformers
  • Integrated pre-trained language models (e.g., BERT, GPT) for context-aware responses
  • Used spaCy and NLTK for intent recognition and entity extraction

Education

B.tech

Goverment College of Engineering, Aurangabad

2023

HSC

Dr. Babasaheb Ambedkar College, Nagpur

2019

Certifications

Achievements

Contact Me

Get in Touch

sofialtamsh123@gmail.com

+91 9156704982

Bangalore, Karnataka

Send Me a Message