Location: Bangalore (ARTgarage)
idle Robotics is a Bengaluru-based startup with the ambitious mission to become the intelligence layer powering all autonomous systems. We believe the future of AI is physical, and the first step is giving machines the ability to perceive the world. We are building a foundational layer of visual intelligence that is, in effect, the "visual cortex" for robots, allowing them to perceive, navigate, and act intelligently. Our dual-use approach means you will contribute to high-impact work, from GPS-denied navigation for defence drones to scalable software for global industrial automation. If you are passionate about robotics, computer vision and pushing the boundaries of Physical AI, we invite you to build the core technology with us.
As our Computer Vision Engineer, you will build classical vision pipelines, deep learning architectures, and foundation model adaptations for detection, segmentation, tracking, and 3D perception. You will optimise models for embedded and edge platforms, work with large datasets, and collaborate across robotics and system architecture teams to bring robust perception systems from prototype to field deployment.
Develop computer vision pipelines using image processing, feature extraction, and classical CV methods.
Build deep learning models for detection, segmentation, tracking, and 3D perception using CNNs, Transformers, and architectures such as YOLO, Faster RCNN, Mask RCNN, UNet, and DeepLab.
Fine-tune and adapt foundation models such as SAM, CLIP, and DINO
Optimise model performance for embedded and edge compute platforms
Design and manage datasets, including cleaning, augmentation, and annotation using CVAT or Label Studio
Run evaluation, profiling, and failure analysis on deployed models
Collaborate with robotics and embedded teams to integrate perception outputs into navigation, control, and planning
Maintain documentation, experiment logs, and deployment specifications
Strong foundation in classical computer vision, geometry, and camera calibration
Proficiency in PyTorch or TensorFlow
Experience with YOLO, RCNN family models, UNet, or DeepLab
Proficiency in Python and core libraries such as OpenCV, NumPy, SciPy, TorchVision, and Albumentations
Strong math foundations, including linear algebra, calculus, and probability
Hands-on experience with dataset creation and annotation
Ability to write clean and modular code and work collaboratively
Experience with 3D vision, stereo, SLAM, or reconstruction
Familiarity with self-supervised or vision-language models
Experience using TensorRT, ONNX Runtime, or similar tools
C++ proficiency for performance-critical CV modules
Robotics experience integrating perception pipelines
Collaborate directly with IIT/IISc founders in a high-density engineering environment.
Solve complex "zero-to-one" problems in Physical AI and autonomous systems.
Develop dual-use technology for national defence (GPS-denied navigation) and industrial automation.
Receive substantial ESOPs and equity as a core team member.
Access competitive compensation, paid time off, and a growth-focused culture.
ARTPARK @ IISc : Innovation factory for next-gen robotics & AI
ARTPARK is India's leading deep-tech venture builder and incubator focused on robotics, connected autonomous systems, and AI. Leveraging our unique facilities and ecosystems, we strive to provide meaningful support to very early-stage startups building deep-tech products based in research. We are a nonprofit organization created by Indian Institute of Science (IISc, Bengaluru) with support from the Department of Science & Technology (Government of India) and the Government of Karnataka.