Our General Models
DINO-XSeek
Referring object detection model based on a multimodal large language model, designed to precisely locate objects based on user-input natural language descriptions.
DINO-X
Supports open-world object detection and segmentation with text, visual, and prompt-free modes for precise outputs like bounding boxes, masks, keypoints, and captions.
Grounding DINO
Leverages text prompts to intelligently identify object locations and confidence, simplifying detection across common, long-tail, and dense scenarios.
T-Rex
Achieves object detection and counting through visual prompts without training, adaptable to various scenarios and excelling in dense and overlapping scenes.
Our Custom Services
By training on a small number of sample images, high-quality visual embeddings can be generated to enable accurate recognition and detection of specific targets. This capability is ideal for complex scenarios such as long-tail category identification, industrial customization, and non-standard object detection, empowering efficient business validation and deployment.
DINO-X Platform - Computer Vision Models for developers and enterprises