Supervision Quickstart¶
We write your reusable computer vision tools. Whether you need to load your dataset from your hard drive, draw detections on an image or video, or count how many detections are in a zone. You can count on us! 🤝
We hope that the resources in this notebook will help you get the most out of Supervision. Please browse the Supervision docs for details, raise an issue on GitHub for support, and join our discussions section for questions!
Table of contents¶
- Before you start
- Install
- Detection API
- Plug in your model
- YOLOv8 (
pip install ultralytics
) - Inference (
pip install inference
) - YOLO-NAS (
pip install super-gradients
)
- YOLOv8 (
- Annotate
BoxAnnotator
MaskAnnotator
LabelAnnotator
- Filter
- By index, index list and index slice
- By
class_id
- By
confidence
- By advanced logical condition
- Plug in your model
- Video API
VideoInfo
get_video_frames_generator
VideoSink
- Dataset API
DetectionDataset.from_yolo
- Visualize annotations
split
DetectionDataset.as_pascal_voc
⚡ Before you start¶
NOTE: In this notebook, we aim to show - among other things - how simple it is to integrate supervision
with popular object detection and instance segmentation libraries and frameworks. GPU access is optional but will certainly make the ride smoother.
Let's make sure that we have access to GPU. We can use nvidia-smi
command to do that. In case of any problems navigate to Edit
-> Notebook settings
-> Hardware accelerator
, set it to GPU
, and then click Save
.
!nvidia-smi
Wed Jul 17 14:51:30 2024 +---------------------------------------------------------------------------------------+ | NVIDIA-SMI 535.104.05 Driver Version: 535.104.05 CUDA Version: 12.2 | |-----------------------------------------+----------------------+----------------------+ | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |=========================================+======================+======================| | 0 NVIDIA L4 Off | 00000000:00:03.0 Off | 0 | | N/A 63C P8 14W / 72W | 1MiB / 23034MiB | 0% Default | | | | N/A | +-----------------------------------------+----------------------+----------------------+ +---------------------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=======================================================================================| | No running processes found | +---------------------------------------------------------------------------------------+
NOTE: To make it easier for us to manage datasets, images and models we create a HOME
constant.
import os
HOME = os.getcwd()
print(HOME)
/content
NOTE: During our demo, we will need some example images.
!mkdir {HOME}/images
NOTE: Feel free to use your images. Just make sure to put them into images
directory that we just created. ☝️
%cd {HOME}/images
!wget -q https://media.roboflow.com/notebooks/examples/dog.jpeg
!wget -q https://media.roboflow.com/notebooks/examples/dog-2.jpeg
!wget -q https://media.roboflow.com/notebooks/examples/dog-3.jpeg
!wget -q https://media.roboflow.com/notebooks/examples/dog-4.jpeg
!wget -q https://media.roboflow.com/notebooks/examples/dog-5.jpeg
!wget -q https://media.roboflow.com/notebooks/examples/dog-6.jpeg
!wget -q https://media.roboflow.com/notebooks/examples/dog-7.jpeg
!wget -q https://media.roboflow.com/notebooks/examples/dog-8.jpeg
/content/images
💻 Install¶
!pip install -q supervision
import supervision as sv
print(sv.__version__)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 0.0/135.7 kB ? eta -:--:-- ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 135.7/135.7 kB 3.9 MB/s eta 0:00:00 0.22.0
👁️ Detection API¶
- xyxy
(np.ndarray)
: An array of shape(n, 4)
containing the bounding boxes coordinates in format[x1, y1, x2, y2]
- mask:
(Optional[np.ndarray])
: An array of shape(n, W, H)
containing the segmentation masks. - confidence
(Optional[np.ndarray])
: An array of shape(n,)
containing the confidence scores of the detections. - class_id
(Optional[np.ndarray])
: An array of shape(n,)
containing the class ids of the detections. - tracker_id
(Optional[np.ndarray])
: An array of shape(n,)
containing the tracker ids of the detections.
🔌 Plug in your model¶
NOTE: In our example, we will focus only on integration with YOLO-NAS and YOLOv8. However, keep in mind that supervision allows seamless integration with many other models like SAM, Transformers, and YOLOv5. You can learn more from our documentation.
import cv2
IMAGE_PATH = f"{HOME}/images/dog.jpeg"
image = cv2.imread(IMAGE_PATH)
!pip install -q ultralytics
from ultralytics import YOLO
model = YOLO("yolov8s.pt")
result = model(image, verbose=False)[0]
detections = sv.Detections.from_ultralytics(result)
"detections", len(detections)
('detections', 4)
!pip install -q inference
from inference import get_model
model = get_model(model_id="yolov8s-640")
result = model.infer(image)[0]
detections = sv.Detections.from_inference(result)
"detections", len(detections)
('detections', 4)
!pip install -q super-gradients
!pip install --upgrade urllib3
from super_gradients.training import models
model = models.get("yolo_nas_s", pretrained_weights="coco")
result = model.predict(image)
detections = sv.Detections.from_yolo_nas(result)
"detections", len(detections)
('detections', 7)
👩🎨 Annotate¶
from ultralytics import YOLO
model = YOLO("yolov8x.pt")
result = model(image, verbose=False)[0]
detections = sv.Detections.from_ultralytics(result)
box_annotator = sv.BoxAnnotator()
label_annotator = sv.LabelAnnotator()
annotated_image = image.copy()
annotated_image = box_annotator.annotate(annotated_image, detections=detections)
annotated_image = label_annotator.annotate(annotated_image, detections=detections)
sv.plot_image(image=annotated_image, size=(8, 8))