About Inferra
Inferra is a publication about the unglamorous, high-leverage part of machine learning: getting models out of notebooks and into the real world.
Most ML writing stops at training accuracy. We pick up where that leaves off — exporting models, running fast inference, deploying to the edge and to production, and keeping the whole thing maintainable. Every guide is written for engineers and ships with real, tested code.
What you'll find here
- Model portability and the ONNX ecosystem
- On-device and edge inference
- Production ML workflows and MLOps
- Practical automation for ML teams
Have a topic you want covered? Reach out — this publication is shaped by what readers are actually trying to ship.