К вакансиям
ML Engineer

Inference Engineer Middle/Senior Remote

ID: 26752
13 февраля 2026 г.
Активна
Glam AI

Формат работы

Удаленная работа

📞Способы связи

📄 Оригинальный текст вакансии

#vacancy #job #remote #inference_engineer #middle #senior #вакансия Inference Engineer @ Glam AI Why Join Us? - Collaborate with a powerhouse team of marketing professionals from top industry players like Lensa, Picsart, Viber, AIRI, Yandex. - Benefit from the guidance of investors with a history of successful exits, including the sale of Looksery and AI Factory to Snap for $150M and $166M, respectively. - Be part of a rapidly growing company with $50M ARR and 250K+ happy customers across the US and Europe. - Engage in innovative AI-driven projects in a dynamic and fast-paced startup environment. ### About the Role As an Inference Engineer you will be responsible for optimizing neural networks in real production environments — from profiling and performance analysis to leveraging existing solutions and implementing custom ones when needed. If you've actually made models faster and enjoy working at the intersection of high-level ML and low-level GPU performance, we'd love to talk. ### Key Responsibilities - Profile, benchmark, and identify performance bottlenecks in neural network inference pipelines - Port, adapt and optimize models for on-device inference (latency, memory, battery, thermal stability) - Optimize server-side inference for throughput and cost efficiency - Collaborate with ML researchers to co-design model architectures with inference efficiency in mind ### Qualifications 1. Experience: - Experience in deep learning inference optimization (mobile or edge) - Hands-on with at least one of: - Core ML / TFLite / ONNX Runtime / TensorRT - Metal / Vulkan / OpenCL / OpenGL / CUDA / Triton 2. Technical Skills: - Strong understanding of GPU/NPU architecture and execution model - Solid grasp of inference optimization techniques: quantization, operator fusion, graph optimization ### Benefits - Competitive salary and leadership growth opportunities. - Opportunity to work on innovative AI-based applications. - Supportive, fast-paced startup environment with a strong engineering team. - All necessary equipment provided. Для отклика присылайте резюме в телеграм @foreverinlovewithsummer

🛠 Навыки

Core ML
CUDA
Metal
ONNX Runtime
OpenCL
OpenGL
TensorRT
TFLite
Triton
Vulkan

🎯 Домены

AI
ML
SaaS

🤖 ИИ навыки

architecture regulations
artificial neural networks
Computer Vision
Deep Learning
design thermal requirements
IBM WebSphere
ICT performance analysis methods
maintain core parts
metal and metal ore products
periodisation
profile people
quantum mechanics
state estimation
supercomputing
train medical staff on nutrition

* Навыки определены автоматически с помощью нейросети

🤖 ИИ домены

Artificial Intelligence
Computer Vision
Edge computing
High-Performance Computing
Machine Learning
Mobile computing
Software optimization

* Домены определены автоматически с помощью нейросети

📢 Информация о публикации

Канал:belit_jobs