You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Human action classification system with pose-based (MediaPipe) and video-based (3D CNN) models. Features 100+ architectures for real-time pose classification and temporal models pretrained on UCF-101/HMDB51.
Custom YOLO11m model for detecting and classifying car body damage (99% shattered glass, 96% flat tire detection accuracy)—optimized for high-capacity inference and assistive use in inspection and service workflows like BMW pre-loaner inspections.
YOLO-TLP: detected and classified tiny objects with bounding box dimensions smaller than 15 pixels, outperforming other one-stage detectors. maximum resolution for target observation in real-time applications.
YOLOv12 Underwater Object Detection is an open-source suite for underwater object detection, built on YOLOv12. It offers an end-to-end pipeline with GPU-accelerated training, customizable data augmentations, real-time inference via Gradio, and support for model export (ONNX & PyTorch).
Explore a wide range of computer vision projects and documentation covering everything from object detection, image segmentation, and tracking to pose estimation, object counting, and automated annotation. These resources highlight real-world AI applications built with modern models like Ultralytics YOLO, Meta SAM 2, and other vlms
This project uses YOLO for real-time leukemia detection in blood samples and CNNs for classifying brain hemorrhages in MRI scans. It aims to support faster, more accurate medical diagnostics through deep learning.
A real-time multi-person human pose estimation system using TensorFlow MoveNet Multipose (Lightning). Built with OpenCV for video and webcam inference, it detects and visualizes keypoints and skeletal connections with confidence-based filtering, optimized for speed and multi-person scenarios.
An inline LLM firewall with a sub-10 ms p99 latency target — built in layers across five documented phases. Sits between your app and any LLM endpoint to classify, redact, or block threats in real time, then continuously retrains itself when drift is detected.
A Spatial Retrieval-Augmented Generation system for latent world models, designed for embodied spatial intelligence in robotics, autonomous navigation, and embodied AI. Features ROS2 integration, real-time inference @ 25Hz, and complete robot build guide.
Volleyball tracking - VballNet is a specialized deep learning framework designed for volleyball tracking, built upon the foundation of TrackNetV4. This repository includes two primary models, VballNetV1 and VballNetFastV1
THYX is an edge AI video analysis platform for AIoT, featuring behavior recognition, intelligent alerts, and real-time management. Modular, high-performance, and lightweight — ideal for smart security and industrial scenarios.
PlantAi is a ResNet-based CNN model trained on the PlantVillage dataset to classify plant leaf images as healthy or diseased. This repository includes PyTorch training code, tools to convert the model to TensorFlow Lite (TFLite) for deployment, and an Android app integrating the model for real-time leaf disease detection from camera images.
This project uses YOLO for real-time leukemia detection in blood samples and CNNs for classifying brain hemorrhages in MRI scans. It aims to support faster, more accurate medical diagnostics through deep learning.