mi
Deep Learning Model Optimization Engineer
miniByte
Full-time
Rawalpindi, Punjab, پنجاب
Other-General
Posted:
March 03, 2026
Location:
Rawalpindi, Punjab, پنجاب, Pakistan
Job Description
About the Role
is hiring Deep Learning Model Optimization Engineers to build, train, and optimize state-of-the-art deep learning models for high-performance production deployment. This role sits at the intersection of research and systems engineering, with a strong focus on inference efficiency across GPUs and edge devices.
Key Responsibilities
- Design and implement deep learning models (CNNs, Transformers, hybrid architectures).
- Build scalable training pipelines and distributed training workflows.
- Apply model compression techniques: quantization, pruning, and knowledge distillation.
- Optimize inference using TensorRT, ONNX Runtime, OpenVINO, or TVM.
- Profile and analyze performance bottlenecks using GPU profiling tools.
- Develop custom CUDA/C++ kernels when required.
- Benchmark latency, throughput, and accuracy across hardware platforms.
- Collaborate on deployment using Triton Inference Server and c...
Apply for this Job
Submit your application for the Deep Learning Model Optimization Engineer position at miniByte.
Apply Now Save for LaterJob Overview
Job Type:
Full-time
Location:
Rawalpindi, Punjab, Pakistan
Posted:
March 03, 2026
Deadline:
April 12, 2026