Data Scientist – NLP & Gen AI
Job Summary: We are seeking a highly skilled Data Scientist with strong expertise in Python, NLP, and Generative AI. The ideal candidate will have hands-on experience in building intelligent models using Natural Language Processing (NLP) techniques, working with large datasets, and leveraging LLMs (Large Language Models) to solve real-world business problems.
Key Responsibilities
-
Design, develop, and deploy machine learning and NLP models for text-based applications.
-
Work extensively with Python libraries such as Pandas, NumPy, and Scikit-learn for data processing and analysis.
-
Implement and optimize NLP techniques using tools like NLTK, SpaCy, or Transformers.
-
Build and integrate solutions using Large Language Models (LLMs) and Generative AI frameworks.
-
Perform data cleaning, preprocessing, and feature engineering on structured and unstructured datasets.
-
Develop pipelines for model training, evaluation, and deployment.
-
Collaborate with cross-functional teams to translate business requirements into AI-driven solutions.
-
Stay updated with the latest advancements in AI, NLP, and Gen AI technologies.
Required Skills & Qualifications
-
Strong programming experience in Python.
-
Hands-on experience with NLP (Natural Language Processing) techniques.
-
Proficiency in libraries such as NLTK, Pandas, NumPy, Scikit-learn.
-
Experience working with LLMs (e.g., GPT, BERT, etc.).
-
Knowledge of Generative AI concepts and frameworks.
-
Experience in data preprocessing, data analysis, and model building.
-
Understanding of machine learning algorithms and model evaluation techniques.
-
Strong problem-solving and analytical skills.
Preferred Qualifications
-
Experience with Deep Learning frameworks like TensorFlow or PyTorch.
-
Familiarity with Hugging Face Transformers or similar libraries.
-
Experience in deploying models using APIs or cloud platforms (AWS, Azure, GCP).
-
Knowledge of MLOps practices and model lifecycle management.
Education
-
Bachelor’s or Master’s degree in Computer Science, Data Science, AI, or related field.