Deep Learning Model Optimization Engineer

miniByte
Full-time Rawalpindi, پنجاب Other-General
Posted:
March 03, 2026
Location:
Rawalpindi, پنجاب, Pakistan

Job Description

About the Role
is hiring Deep Learning Model Optimization Engineers to build, train, and optimize state-of-the-art deep learning models for high-performance production deployment. This role sits at the intersection of research and systems engineering, with a strong focus on inference efficiency across GPUs and edge devices.
Key Responsibilities
Design and implement deep learning models (CNNs, Transformers, hybrid architectures).
Build scalable training pipelines and distributed training workflows.
Apply model compression techniques: quantization, pruning, and knowledge distillation.
Optimize inference using TensorRT, ONNX Runtime, OpenVINO, or TVM.
Profile and analyze performance bottlenecks using GPU profiling tools.
Develop custom CUDA/C++ kernels when required.
Benchmark latency, throughput, and accuracy across hardware platforms.
Collaborate on deployment using Triton Inference Server and containerized environments.
Requirements
Master's degree i...

Apply for this Job

Submit your application for the Deep Learning Model Optimization Engineer position at miniByte.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: Rawalpindi, Pakistan
Posted: March 03, 2026
Deadline: April 12, 2026