Lead the development and deployment of NLP-based solutions to process and analyze unstructured data at scale.
Design, train, and optimize machine learning models using libraries such as PyTorch, NLTK, and Scikit-learn.
Architect and deploy AI/ML products on cloud platforms like Azure, GCP, or AWS.
Collaborate with data engineering teams to ensure seamless integration of AI models into production systems.
Perform advanced SQL analytics to extract actionable insights from structured datasets.
Mentor junior data scientists and foster a culture of technical excellence within the team.
Partner with customers to understand their needs and translate them into technical solutions.
Requirements
Minimum 8 years of experience in data science, with a focus on NLP and unstructured data processing.
Proven track record of launching NLP-driven products to live users.
Expertise in Python and standard libraries such as PyTorch, NLTK, and Scikit-learn.
Experience with Transformer-based models (e.g., BERT, GPT).
Develop, train, and optimize ML and deep learning models (classification, regression, clustering, sequence modeling, embeddings).
Implement and fine-tune transformer-based models such as BERT, GPT-style LLMs, and domain-specific architectures.
Strong experience with one or more cloud platforms (Azure, GCP, AWS) for hosting and deploying AI/ML products.
Design and implement NLP pipelines for text classification, information extraction, topic modeling, semantic search, summarization, and conversational AI applications.
Familiarity with data engineering pipelines and best practices.
Proficiency in SQL for analytics and data manipulation.