#вакансия #астана #оффлайн #workITkz #архитектор, #dsml, #python
Должность: ML/AI lead
Компания: MDiet
https://mdiet.org/
Город: Астана
Занятость: оффлайн
Оплата: $4000-$5000
MDiet is an early-stage HealthTech startup developing an AI-powered platform that helps reduce patient readmissions.The product targets the US healthcare market.
Описание вакансии:
Tech Stack:
- LangGraph + LangChain (Agentic AI)
- vLLM + Hugging Face
- PyTorch + PEFT (LoRA/QLoRA)
- Vector databases (pgvector, Qdrant, Weaviate, Chroma, Pinecone)
- Advanced RAG + Prompt Engineering + Embeddings
- FastAPI + PostgreSQL
- Docker + Kubernetes + async processing (Arq / Celery / Redis Streams)
- Apache Airflow (data pipelines)
- AWS Cloud (EC2, S3, SageMaker, Lambda, RDS, Bedrock)
- Structured output (Outlines / Guidance / Instructor)
- Evaluation & observability (Langfuse, RAGAS / DeepEval, Prometheus + Grafana)
- MLOps practices
Requirements:
- Strong hands-on experience with production LLM systems (RAG + agents + inference)
- Experience with fine-tuning open-source LLMs (LoRA/QLoRA/PEFT) with measurable results
- Experience building data pipelines and versioning for LLM/RAG workloads
- Strong skills in evaluation, hallucination reduction and structured output
- Production Python + FastAPI + async processing
- Experience with self-hosted inference (vLLM or similar) and GPU optimization
- English proficiency at C1 level (strong written and spoken communication skills)
- Bachelor’s degree (or higher) in Computer Science, Artificial Intelligence, Machine Learning, Data Science, or a related technical field
- 5+ years in ML/AI, with 1.5–2+ years in production LLM projects
- Experience with MLOps
Responsibilities:
- Design and build multi-step LLM agents and pipelines for medical data curation
- Develop production RAG systems with high grounding and citation quality
- Deploy and optimize self-hosted LLM inference (vLLM) with cost and throughput optimization
- Fine-tune open-source models on domain-specific medical data with measurable improvement
- Build data pipelines and versioning for documents, embeddings and training data
- Create evaluation frameworks, guardrails and hallucination control systems
Implement reliable structured output and data extraction
- Work closely with physicians and Platform Architect
Nice to have:
- Experience in medical / clinical NLP or regulated domains
- Cost modeling and optimization of LLM workloads
- MLOps and production fine-tuning pipelines
- Apache Airflow, Prometheus/Grafana, or AI Solution Architecture
- Publications in relevant fields
Контакты:
Telegram TG: @team_mdiet Email: team@mdiet.org
Whatsapp
Whatsapp
whatsapp
Языки
английский (C1 — Продвинутый)
Навыки
Apache Airflow
Arq
AWS
Bedrock
Celery
Chroma
DeepEval
Docker
EC2
FastAPI
Grafana
Guidance
Hugging Face
Instructor
Домены
AI
HealthTech
ML
ИИ навыки
Apache Airflow
Arq
AWS Cloud
Celery
Chroma
DeepEval
Docker
Embeddings
FastAPI
GPU optimization
Grafana
Guidance
Hugging Face
Instructor
ИИ домены
Artificial Intelligence
Healthcare
HealthTech
Machine Learning
Natural Language Processing
* Домены определены автоматически с помощью нейросети