OWL-ViT is a zero-shot text-conditioned object detection model | Tensorflow(@CVision)

OWL-ViT is a zero-shot text-conditioned object detection model that allows querying images with text descriptions of unseen objects. It has impressive generalization capabilities and is on par with some of the state-of-the-art object detection models.

Docs: https://huggingface.co/docs/transformers/main/en/model_doc/owlvit
Demo: https://huggingface.co/spaces/adirik/OWL-ViT
Tutorial: https://colab.research.google.com/github/huggingface/notebooks/blob/main/examples/zero

Tensorflow(@CVision)

👪 8.66K
فن آوری ها

اخبار حوزه یادگیری عمیق و هوش مصنوعی. مقالات و یافته های جدید یادگیری عمیق. بینایی ماشین و پردازش تصویر. TensorFlow,...

Join
▲ Vote (1)

OWL-ViT is a zero-shot text-conditioned object detection model | Tensorflow(@CVision)

Login