Bonfy.AI is building the trust layer for generative AI. Our Adaptive Content Security platform detects and mitigates subtle risks embedded in large language model (LLM) outputs before they reach users. From hallucinations to hidden data leaks, we enable enterprises to deploy GenAI confidently, without compromising truth, privacy, or reputation.

We are model-agnostic, outcome-driven, and unapologetically rigorous. Our customers include leading Fortune 500 teams working in high-stakes sectors where trust is not optional.

Why This Role Matters

We need an MLOps Engineer to optimize our GPU-accelerated ML infrastructure for performance and cost efficiency. Working with our existing Sr. DevOps and Sr. SRE teams, you'll focus on the specialized ML optimization challenges that require deep machine learning expertise.

GPU & Cost Optimization: Design optimal GPU configurations and ML deployment strategies to maximize performance while minimizing cloud costs.
ML Performance Tuning: Optimize model serving, memory management, and inference pipelines for production LLM workloads. You will also work with models and customize prompts, write pre- and post-processing methods to improve accuracy and speed (production coded), and implement new models functionality in the system.
DevOps Collaboration: Work with our Sr. DevOps/SRE teams to implement ML-specific solutions and monitoring

What We're Looking For

ML Infrastructure Optimization: 5+ years optimizing production ML systems with focus on GPU utilization and cost management
GPU & LLM Expertise: Deep understanding of GPU architectures, memory management, and LLM inference optimization
Python + DevOps Integration: Expert Python programming with experience working alongside DevOps/SRE teams on ML-specific solutions
Bonus: Experience at GPU-focused ML companies (SambaNova, NVIDIA, etc.) or with high-performance ML serving frameworks

Why Join Us

Collaborative Impact: Work with our existing Sr. DevOps and Sr. SRE teams to solve ML-specific challenges that require specialized expertise
Technical Depth: Focus purely on cutting-edge ML optimization problems without getting pulled into general infrastructure work
High Autonomy: Direct collaboration with engineering leadership in a fast-paced, technically rigorous environment
Competitive Package: Strong salary, equity, comprehensive benefits, and flexible hybrid work model

Bonfy.AI — Truth. Security. Intelligence.

This job is no longer accepting applications

See open jobs at Bonfy.AI.See open jobs similar to "Senior Machine Learning Operations (MLOps) Engineer" TLV Partners.

See more open positions at Bonfy.AI

Powered by Getro.com

Privacy policy Cookie policy