Staff Machine Learning Engineer

United States of America

Full Time

4 hours ago

$154,000 - $226,000 USD

No H1B

Key skills

Machine LearningDeep LearningGenerative AIReinforcement LearningRecommendation SystemsTransformersFine-tuned Large Language ModelsCausal InferenceRegressionAgentic AIMulti-agent SystemsLarge Language ModelsLangChainLlamaIndexRetrieval Augmented GenerationHybrid RetrievalRerankingQuery ExpansionContextual CompressionParameter-efficient Fine-tuningLoRAQLoRAPEFTModel QuantizationQuantization-aware Fine-tuningChain-of-Thought PromptingTree-of-Thought PromptingGraph-of-Thought PromptingML/LLM OperationsPyTorchTensorFlowHugging Face TransformersConversational AIMulti-modal LLM ModelsData MiningAdvanced StatisticsKubeflowDVCRaygRPCGraphQLDatabricks MLflowDatabricks Model ServingApache AirflowSparkFlinkKafkaKinesisSnowflakeEngineering ExcellenceOne TeamAIMLGenAILLMRAGAgenticXGBoostMLflowDatabricksAirflowMentoring

About this role

Credit Acceptance is an award-winning company recognized for its exceptional workplace culture. They are seeking a highly motivated and experienced Staff Machine Learning Engineer to lead the development of AI-powered solutions, collaborating with business and engineering stakeholders to achieve strategic goals and deliver innovative solutions.

Responsibilities:

Explore and apply advanced machine learning techniques, including not limited to large language models (LLMs), deep learning, and graph neural networks, to solve complex challenges across the organization
Collaborate with management and stakeholders to define strategic roadmaps and translate them into actionable quarterly plans
Drive execution and delivery of ML/AI solutions by managing priorities, deadlines, and deliverables, leveraging your technical expertise
Design and deliver scalable, secure systems using state-of-the-art AI/ML technologies and industry best practices, and nurture the culture of creating high-quality, well-tested systems to address critical product and business needs
Troubleshoot and resolve complex technical issues to improve system reliability, scalability, and operational efficiency
Ensure the security, scalability, and architectural integrity of feature designs through reviews across teams
Deliver hands-on solutions while mentoring other data professionals (including MLEs) within the organization
Guide a team of MLEs across different areas:
Mentor team members on design principles, coding standards, and the adoption of AI productivity tools
Personalize guidance across different surfaces using deep learning methods; personalize layouts with Bayesian contextual multi-armed bandits
Foster long-term growth through data-driven causality and incrementality
Power existing applications with Gen AI models and engineering to improve downstream experience and decisions
Using ML models (such as XGBoost & Causal Meta-Learner-based model, etc), proactively guide business teams across different areas
With engineering partners, build ML and Gen-AI platform and inference pipelines for different types of models
Architect and implement enterprise-grade LLM-powered solutions, managing the full lifecycle from business requirements to production deployment, monitoring, and continuous optimization
Design and develop multi-agent GenAI systems using state-of-the-art frameworks (LangChain, LlamaIndex) to orchestrate complex workflows across retrieval augmentation, data operations, and compliance verification
Engineer robust Retrieval Augmented Generation (RAG) pipelines incorporating advanced techniques such as hybrid retrieval, reranking, query expansion, and contextual compression
Implement parameter-efficient fine-tuning strategies (LoRA, QLoRA, PEFT) to adapt foundation models to domain-specific use cases while optimizing for inference costs and latency
Develop intelligent routing and orchestration systems to manage conversation state across multiple specialized AI agents, ensuring seamless transitions between different system capabilities
Build evaluation frameworks to measure and improve LLM performance across diverse metrics, including factuality, coherence, task completion, and alignment with business objectives
Integrate LLM solutions with existing enterprise architecture, ensuring compliance with data security policies, authentication mechanisms, and transaction safety requirements

Requirements:

PhD in Computer Science, Stats, Economics, or a relevant technical field with at least 5+ years of relevant experience or MS with at least 8+ years of experience in machine learning and software engineering
ML Skills: 6+ years of hands-on experience designing, building and deploying AI (ML, DL, Gen-AI) models, including Reinforcement Learning algorithms, Recommendation systems, Transformers, fine-tuned LLMs, Causal Inference, Regressions, etc., with a solid understanding of mathematics, statistics, and engineering needed to build such infra
GenAI Skills: 4+ years of experience building and deploying AI/ML applications including Reinforcement algorithms, Recommendation systems, Generative AI etc. with solid understanding of mathematics, Computer Science, foundation concepts and engineering behind building AI applications and LLMs
Experience applying agentic AI to design and implement scalable multi-agent systems
Strong problem-solving skills with bias for action
Experience in the automotive industry, especially in building ML/AI systems while ensuring local and central regulations
Experience in model interpretability and responsible AI practices
Expertise in data science, advanced experimentation and visualization techniques
Experience in designing and implementing pipelines using DAGs (e.g., Kubeflow, DVC, Ray)
Ability to construct batch and streaming microservices exposed as gRPC and/or GraphQL endpoints
Experience with Databricks MLflow for ML lifecycle management and model versioning
Hands-on experience with Databricks Model Serving for production ML deployments
Demonstrable experience in parameter-efficient fine-tuning, model quantization, and quantization-aware fine-tuning of LLM models
Hands-on knowledge of Chain-of-Thoughts, Tree-of-Thoughts, Graph-of-Thoughts prompting strategies
Knowledge of multimodal AI (text, image, audio integration)
Proficiency with GenAI frameworks/tools and technologies such as Apache Airflow, Spark, Flink, Kafka/Kinesis, Snowflake, and Databricks