Sr AI/ML Engineer - Hybrid in MN or DC - Remote elsewhere
Explicitly requires experience with AI-assisted or 'vibe coding' tools (e.g., Codex, Claude Code, Cursor, Windsurf).
About the Role
Senior AI/ML Engineer to design, build, and productionize generative AI and LLM solutions for enterprise healthcare use cases, focusing on RAG, agentic systems, and responsible AI. The role supports remote work across the U.S. with hybrid in Minneapolis or Washington, D.C.
Job Description
Role
Senior AI/ML Engineer responsible for designing and building production-grade generative AI, NLP, and large language model (LLM) solutions to support enterprise healthcare applications. Work includes LLM fine-tuning, retrieval-augmented generation (RAG), agentic systems, and implementing safety, monitoring, and governance for deployed systems.
Key Responsibilities
- Design and deploy transformer-based generative AI and NLP solutions for search, summarization, extraction, conversational AI, and decision support.
- Develop and fine-tune LLM applications using commercial and open-source models and APIs.
- Design and implement RAG pipelines: document ingestion, chunking, embedding generation, and vector database integration.
- Build agentic AI systems (single- and multi-agent) with tool use, reasoning, planning, and autonomous task execution.
- Implement evaluation and monitoring frameworks, including hallucination detection, bias monitoring, and human-in-the-loop evaluation.
- Develop safety and governance controls: prompt hardening, policy enforcement, and responsible AI guardrails.
- Productionize AI/ML pipelines with CI/CD, testing, monitoring, and observability in cloud-native environments.
- Collaborate with product, platform, and data science teams to translate requirements into scalable, reliable AI solutions.
Requirements
- 5+ years experience in machine learning, AI engineering, or applied data science with production ML systems.
- 5+ years building Python-based ML systems using frameworks such as PyTorch, TensorFlow, or Hugging Face.
- 3+ years building Generative AI or LLM applications, including prompt engineering, model fine-tuning, and API integrations.
- 2+ years designing RAG systems and vector search pipelines.
- 1+ year building agentic AI systems (tool-calling, planning frameworks, multi-agent workflows).
- 1+ year using AI-assisted development / “vibe coding” tools (examples: Codex, Claude Code, Cursor, Windsurf).
Preferred Qualifications
- Experience with LLM orchestration frameworks such as LangChain, LlamaIndex, or Semantic Kernel.
- Experience optimizing LLM latency, cost, and production performance.
- Experience deploying AI systems on AWS, Azure, or GCP.
- Experience working in regulated environments with responsible AI governance requirements.
Location & Remote Work
- Flexibility to work remotely from anywhere within the U.S.; employees hired in Minneapolis or Washington, D.C. are required to work in the office a minimum of four days per week.
- All remote employees must adhere to the company’s Telecommuter Policy.
Compensation & Benefits
- Salary range listed: $91,700 to $163,700 annually (based on full-time employment).
- Benefits include a comprehensive benefits package, incentive and recognition programs, equity stock purchase, and 401(k) contribution.