Abstract: Object Recognition and Document Skew Estimation have come a long way in terms of performance and efficiency. New models follow one of two directions: improving performance using larger ...
A real-time face recognition-based attendance system built with Flask, OpenCV, and face_recognition. This project enables automatic attendance marking, user management, live monitoring, and ...
TonyPi AI humanoid robot brings Raspberry Pi 5 vision, voice control, and multimodal model integration to an 18-DOF education ...
Abstract: The consumption of image data by machines is rapidly increasing due to the growing adoption of image recognition technologies. This trend has accelerated research in image compression ...
InstructSAM is a training-free framework for Instruction-Oriented Object Counting, Detection, and Segmentation (InstructCDS). We construct EarthInstruct, an InstructCDS benchmark for remote sensing.