Filter

My recent searches
Filter by:
Budget
to
to
to
Type
Skills
Languages
    Job State
    1,086 yolo jobs found

    I need an engaging, interactive workshop for college students on personal finance. The workshop should cover: - Budgeting and Saving - Investing Basics - Debt Management What not to do for a college student - fomo/ Yolo etc Financial habits How to choose the right job Which field to focus on Concept of personal balance sheet What to focus - savings vs investment FU Money How to much earn vs how much to save Concept of financial freedom Savings instruments Salary Negotiations- higher fixed vs higher variable Longevity of job vs risky high paying job Suggest an appropriate session topic for dame The workshop should last 1 hour and be designed to actively involve students, encouraging participation and discussion. Ideal skills and experience: - Experience in personal finance e...

    $11 Average bid
    $11 Avg Bid
    2 bids

    I want a single, turnkey application that watches my CCTV feeds, spots shop-lifters in real time, recognises grocery products on the shelves, and keeps a live head-count of customers. The core model must be YOLO, and I need the exact same code-base to compile and run on both Windows (desktop with NVIDIA GPU) and a Raspberry Pi 4. Video sources vary—some cameras stream RTSP over IP while a few older analog units reach the NVR through a capture card—so the program has to accept either type without manual re-configuration. For product recognition I care only about groceries; no clothing or electronics labelling is necessary. The model should be trained (or fine-tuned) on the most common supermarket items so false positives stay low even when shelves are crowded. Key ...

    $725 Average bid
    $725 Avg Bid
    131 bids

    ...straight into a MySQL database. Each stored image must be linked to the corresponding frame number and any detection metadata so I can later query, filter, and analyse the results. Once the data is stored, I want a lightweight viewer that steps through the saved frames in order, overlaying the detection boxes so I can visually confirm accuracy. OpenCV for frame extraction and display is acceptable; YOLO, TensorFlow, or another modern model is fine so long as the code is clean, well-commented, and easy for me to retrain with additional classes. To keep the hand-off smooth, please include: • A self-contained Python 3 script (or module set) that performs detection, inserts frames into MySQL, and plays them back. • The SQL schema and sample data script. • A brief...

    $300 Average bid
    $300 Avg Bid
    31 bids

    ...dispatched quickly. I’m flexible about the imagery source—NASA, ESA, Google Earth, or any other free feed is fine as long as it delivers cloud-free, high-resolution scenes. You can use the tool to capture screenshots by moving in circles around the selected location. The detector has to work at desert scale, so please build it with an established computer-vision framework (e.g., TensorFlow, PyTorch, YOLO, or a similarly robust model) and output the findings in both human-readable (an image with bounding boxes or a simple web map) and machine-readable form (CSV/GeoJSON with lat/long, time stamp, confidence score). Once I apply the tool to a new location and receive a list of car and truck pictures and coordinates automatically reflected on the map, no manual clicks&...

    $205 Average bid
    $205 Avg Bid
    87 bids

    ...the head centered) • Auto zoom / smart cropping (keeping head size consistent) • Smooth motion and transition logic At this stage, we do not yet have final hardware or SDK access, so we are asking the freelancer to: Phase 1 – Prototype (on your own hardware) • Build a working demo using your own Android device / webcam / test camera • Use open-source tools (MediaPipe / TensorFlow Lite / OpenCV / YOLO, etc.) • Demonstrate: 1. Live head tracking 2. Auto zoom in real time 3. Stable performance (≥15 FPS) Deliverables • Full source code • Build instructions • Short demo video showing real-time performance • Explanation of how this will later integrate with a custom camera SDK Once validated, this prototype will be integrated i...

    $111 Average bid
    $111 Avg Bid
    76 bids

    ...generation—all with strict patient privacy, no storage of originals, and human oversight required. Key Requirements: • Clean React/ frontend with drag-and-drop upload, DICOM viewer (e.g., ), annotation overlays & heatmaps. • Python backend (FastAPI preferred) + secure auth, encrypted file handling, and cloud storage (AWS S3/GCP). • PyTorch/TensorFlow ML models (fine-tune YOLO/U-Net/MONAI on open dental datasets) for multi-label detection/segmentation. • Mandatory: Full anonymization on upload (pydicom/deid), end-to-end encryption, audit logs, compliance-ready (HIPAA/GDPR/APP principles), ethical transparency (e.g., explainability features). • Cloud deployment (AWS/GCP/Azure, serverless ideal). NDA required. Bid with experience in medica...

    $407 Average bid
    $407 Avg Bid
    206 bids

    ... and push actionable alerts through a cloud-hosted pipeline. Later phases will weave in IoT sensors, GPS data and public-transport schedules, but for now cameras take centre stage and all processing happens in the cloud. Here’s what I need from you: • A complete stream-to-insight workflow: camera feed ingestion → cloud message bus → analytics micro-service. • Computer-vision models (OpenCV, YOLO, TensorFlow—your call) that flag incidents with >90 % precision/recall on my test clips. • A REST API that surfaces live traffic state and returns diversion routes in real time. • Extension hooks so I can sync bus, train and metro timetables and forward delay alerts to commuters. • Containerised or serverless deployment scripts s...

    $42962 Average bid
    $42962 Avg Bid
    21 bids

    I already have a working Python pipeline that watches a live game feed, runs it through YOLO to carve out short MP4 snippets, and pushes those clips to a monitor in real time. A Flutter front-end then lets fans pull either single clips or concatenated highlight reels to their iPhone or Android devices. In addition, the Yolo feed provides information for other performance calculations that post under each snippet on the monitor. Two pain points are slowing us down: 1. YOLO interference Right now one key smaller object interferes with other objects that are detected because there is no prioritization layer in place. This causes some of the performance calculations to be intermittent resulting in some incomplete calculations. I need logic that decides, on the ...

    $33 / hr Average bid
    $33 / hr Avg Bid
    222 bids

    ...is **not optional**. You must design the **AI brain** of the system, including: #### AI Methods * Computer Vision (façade detection, cracks, dirt, dimensions) * SLAM / Visual-Inertial Odometry * Object detection & segmentation * Path planning & autonomous navigation * Reinforcement Learning or Rule-Based Control * Predictive maintenance models #### Software & Coding * AI model architecture (YOLO / Transformer / CNN etc.) * Training data requirements * Edge AI vs Cloud AI decision * APIs & system architecture * Simulation tools (Gazebo, AirSim, ROS2) * Control logic (PX4 / ArduPilot) ⚠️ **You must explain the logic, workflows, and pseudo-code or real code structure.** --- ### 4 Costing & Commercial Feasibility You will deliver: * Prototype cost...

    $2249 Average bid
    $2249 Avg Bid
    164 bids

    ..."narcotics," "arrest," "isolate yourself"). * Flag high-pressure/threatening tones. * Visual Forensics (Computer Vision): * Liveness/Deepfake Detection: Identify if the face in the video is AI-generated (looking for lack of blinking, lip-sync errors, or artifacts). * Uniform/Badge Recognition: Detect if the person is wearing a police uniform or showing a badge (using object detection like YOLO). * Real-Time Risk Dashboard: * A simple UI that displays a "Trust Score." If the score drops below a threshold, it shows a "SCAM ALERT" warning. Preferred Tech Stack: * Language: Python * ML Frameworks: TensorFlow / PyTorch / Keras * Computer Vision: OpenCV, MediaPipe * NLP: Hugging Face Transformers (BERT/RoBERTa for inte...

    $117 Average bid
    $117 Avg Bid
    16 bids

    ...hardware required) Optimize for snap, lock, weld, and track behavior with precise joystick control Debug latency, drift, misalignment, and tracking edge cases Required Experience (Must Have) Cronus Zen experience (scripts, HID behavior, real-world tuning) Ownership of Xbox or PlayStation console for live testing Microcontroller experience (Arduino-class or higher) Strong background in computer vision (YOLO or similar real-time detection pipelines) Experience with transparent overlays (Windows overlay windows, layered rendering, etc.) vGamepad / XInput / virtual controller experience Comfortable debugging timing, latency, and signal stability Strongly Preferred Experience with real-time CV + joystick automation Familiarity with Python, C++, or C# in CV or input systems Understandi...

    $488 Average bid
    $488 Avg Bid
    80 bids

    ...Android developer to help build AI-powered mobile applications. The work involves integrating machine learning models for real-time detection and analysis, both running locally on-device (offline) and via cloud APIs. Note: Your task will be integration of AI models in app architecture but its nice to have knowledge about AI models. What You'll Be Working On Integrating custom detection models (YOLO, TensorFlow Lite, ONNX) into Android apps Implementing on-device inference for offline functionality Connecting apps to AI APIs for cloud-based processing Optimizing model performance for android mobils(Snapdragon 6 cpus etc.) Building clean UI to display detection results in real-time Required Skills Strong Android development experience (Kotlin/Java/Flutter) Hands-on exp...

    $21 / hr Average bid
    $21 / hr Avg Bid
    174 bids

    I need a piece of software that plugs straight into the video stream coming from existing, off-the-shelf CCTV cameras and immediately adds deep-learning smarts. The core ...Deliverables • Installable software (source + compiled package) that connects to standard RTSP/ONVIF camera feeds • Model training or transfer-learning pipeline that achieves high accuracy on my sample footage • Real-time alert module covering email, SMS, in-app and monitor popups • Setup guide and brief user manual I’m comfortable if you leverage frameworks such as Python, OpenCV, TensorFlow or YOLO, as long as performance remains near real-time on 1080p streams. Let me know your approach, estimated turnaround time and any prerequisites you’ll need from my side (e.g., l...

    $22 / hr Average bid
    $22 / hr Avg Bid
    57 bids

    Healthcare AI Chatbot with Document Detection and CCTV Emergency Monitor...Fire or smoke Crowd congestion Abnormal situations: Person lying on floor Sudden group gathering Visual Output Bounding boxes for people Circles or highlights for danger zones Annotated frames Chatbot Alerts Text-based alerts inside chat: “Crowd detected in corridor” “Fire detected near patient area” Tech Stack (Expected) Backend: Python Computer Vision: OpenCV Detection Models: Pretrained YOLO OCR: Tesseract Tracking: SORT or centroid tracking Chatbot: Rule-based or LLM API Output: JSON + annotated images Deliverables Appointment booking chatbot logic Document detection and OCR pipeline CCTV analysis module Emergency detection rules Chatbot responses for all modules Source code ...

    $72 Average bid
    $72 Avg Bid
    11 bids

    ... and push actionable alerts through a cloud-hosted pipeline. Later phases will weave in IoT sensors, GPS data and public-transport schedules, but for now cameras take centre stage and all processing happens in the cloud. Here’s what I need from you: • A complete stream-to-insight workflow: camera feed ingestion → cloud message bus → analytics micro-service. • Computer-vision models (OpenCV, YOLO, TensorFlow—your call) that flag incidents with >90 % precision/recall on my test clips. • A REST API that surfaces live traffic state and returns diversion routes in real time. • Extension hooks so I can sync bus, train and metro timetables and forward delay alerts to commuters. • Containerised or serverless deployment scripts s...

    $42782 Average bid
    $42782 Avg Bid
    37 bids

    ... and push actionable alerts through a cloud-hosted pipeline. Later phases will weave in IoT sensors, GPS data and public-transport schedules, but for now cameras take centre stage and all processing happens in the cloud. Here’s what I need from you: • A complete stream-to-insight workflow: camera feed ingestion → cloud message bus → analytics micro-service. • Computer-vision models (OpenCV, YOLO, TensorFlow—your call) that flag incidents with >90 % precision/recall on my test clips. • A REST API that surfaces live traffic state and returns diversion routes in real time. • Extension hooks so I can sync bus, train and metro timetables and forward delay alerts to commuters. • Containerised or serverless deployment scripts s...

    $29 Average bid
    $29 Avg Bid
    10 bids

    I want a desktop-based vision solution that watches any TV, computer monitor, or public display in real time and notifies me the moment someone steps in front of it and tries to take a photo. The core may rely on YOLO or straight OpenCV or Any—whichever gives the fastest, most reliable detection. How it should work • A camera connected to my desktop continuously analyses the scene. • When a person holding up a phone (or clearly preparing to photograph) is detected, the system must: – Crop the frame (or short clip) so the person is centred. – Timestamp it. – Dispatch the alert simultaneously to Telegram, WhatsApp, and Email or Any platform. Key expectations • Sub-second latency between detection and alert. • Clean, wel...

    $10 / hr Average bid
    $10 / hr Avg Bid
    20 bids

    I want a desktop-based vision solution that watches any TV, computer monitor, or public display in real time and notifies me the moment someone steps in front of it and tries to take a photo. The core may rely on YOLO or straight OpenCV or Any—whichever gives the fastest, most reliable detection. How it should work • A camera connected to my desktop continuously analyses the scene. • When a person holding up a phone (or clearly preparing to photograph) is detected, the system must: – Crop the frame (or short clip) so the person is centred. – Timestamp it. – Dispatch the alert simultaneously to Telegram, WhatsApp, and Email or Any platform. Key expectations • Sub-second latency between detection and alert. • Clean, wel...

    $1374 Average bid
    $1374 Avg Bid
    46 bids

    My in-house Video Management System already ingests live H.264 streams from multiple IP cameras positioned around large industrial sites. The next step is to embed two real-time analytics modules directly into th...for now—just accurate detection and high-quality face capture that I can archive or pass to other systems later. Acceptance criteria 1. ≥95 % plate read accuracy on my provided test set of Indian vehicles. 2. Face box IoU ≥0.8 against ground-truth on the same streams. 3. End-to-end latency (frame in ➜ metadata out) ≤300 ms at 1080p30. If you have prior deployments of OpenCV + TensorRT, YOLO-based detectors, EasyOCR, PaddleOCR or similar on Indian road footage, mention them when you respond; sample screenshots or short demo clips will help me sho...

    $19 / hr Average bid
    $19 / hr Avg Bid
    16 bids

    I have a mixed set of still photos and short video cl...care as much about keeping a unique ID on each instance across frames as I do about tight, well-placed boxes or polygons. You will receive the raw media in batches. For each image or video frame, mark every visible person, vehicle and animal, assign the correct class label, then keep the same ID on that object until it leaves the scene. I’m happy with standard output formats such as COCO JSON, YOLO TXT, or a simple CSV so long as coordinates, class names and tracking IDs are included. Before we start the full set, I’ll provide a small sample; your annotations on that sample will serve as the acceptance test for quality and consistency. Once approved, you can move on to the remaining files and deliver them in agree...

    $12 / hr Average bid
    $12 / hr Avg Bid
    13 bids

    Practical AI tooling for defect detection in a manufacturing use case

    $150 Average bid
    $150 Avg Bid
    1 bids

    • build a realistic-looking synthetic dataset from our video, • generate controlled synthetic defects on top of clean frames, • train a detector and see how far we can get using synthetic-only or mostly synthetic data.

    $80 Average bid
    $80 Avg Bid
    1 bids

    ...clip of the incident 20s for example. Channels: Integration with WhatsApp API (e.g., Twilio or Meta API) OR a custom Mobile App (Flutter/React Native) for security staff. Dashboard: A simple web-based or local interface to view live logs, replay detected incidents, and manage sensitivity settings. Technical Requirements: Programming Language: Python. Frameworks: PyTorch, TensorFlow, OpenCV, YOLO (v8/v10), or MediaPipe. Hardware Compatibility: Must be optimized for NVIDIA CUDA cores / TensorRT. Scalability: The code should support multiple camera streams simultaneously. Deliverables: Full Source Code (well-documented). Setup Guide (How to install on the NVIDIA device and connect cameras). A working prototype/MVP demonstrating the detection of basic theft actions. Ideal...

    $577 Average bid
    $577 Avg Bid
    100 bids

    ...new bullet hole, and then calculates the shooter’s score according to standard concentric-ring values. Once the session ends, the system should compile everything into a clear, detailed report that can be downloaded. Scope of work • Detect the target board in every frame, even if lighting changes or smoke drifts across the lens. • Technical preferences I am comfortable with Python, so OpenCV, YOLO, TensorFlow, or a similar stack will fit well. If you prefer another language or library, just make sure the final solution runs on Windows 10 and can be triggered by a simple command or scheduled task. Deliverables 1. Well-commented source code and any trained weight files. 2. A small sample dataset for verification. 3. Step-by-step setup guide. 4. Example ou...

    $320 Average bid
    $320 Avg Bid
    12 bids

    I’m putting together a real-time inspection line that must catch both subtle textural defects and broader anomalies as products move past a USB industrial camera. The vision stack is Python-based, with YOLO as the starting architecture in PyTorch, and the finished model needs to run on an NVIDIA Jetson under tight latency constraints. Here’s the core flow I need help completing: continuous frame acquisition through the camera’s SDK (not a generic webcam wrapper), on-device preprocessing, YOLO inference accelerated with TensorRT, and immediate feedback—either through GPIO or a lightweight web dashboard—so operators can see the live image, bounding boxes, confidence scores, and flag any misses. Training scripts should let me fine-tune the...

    $189 Average bid
    NDA
    $189 Avg Bid
    11 bids

    ...Web page (HTML/CSS/JS or a simple framework like Flask, Django, Node.js, etc.) - Google Maps integration with API key placeholder - Documentation for setup and usage --- ### **5. Skills Required** - Python (PyTorch, Ultralytics YOLO) - Machine Learning / Computer Vision - Web development (basic front‑end + simple backend) - Experience with Google Maps API - Metadata extraction (EXIF, GPS tags) --- ### **6. Additional Notes** - Dataset will be provided after project award. - Clean, well‑commented code is required. - Preference for someone who has worked with YOLO models before. --- ...

    $368 Average bid
    $368 Avg Bid
    181 bids

    I need a lightweight mobile application that can automatically locate faces in both photos and live video, running smoothly on iOS and Android. The core detector must be YOLO; however, you’re free to combine it with OpenCV or MediaPipe for preprocessing or post-processing as long as overall inference stays fast on mid-range devices. The front end can be built in Flutter or React Native—whichever lets you reach 60 fps video preview without draining the battery. A minimal, clean UI is enough: pick an image from the gallery, record or stream video, and immediately see bounding boxes appear in real time. You will train or fine-tune the model yourself, document the training pipeline, and include the final weights in the repo. Optimisation for size and speed (quantisat...

    $158 Average bid
    $158 Avg Bid
    39 bids

    ...complete, camera-ready body-emotion detection pipeline that runs in real time on a standard classroom CCTV feed. The end goal is to monitor student engagement continuously, flagging moments of happiness, surprise, confusion, fear, frustration and closely related states so that teaching staff can react while the lesson is still in progress. Scope of work • Model exploration: start with CNN-BiLSTM and YOLO-based approaches, experiment with any other architecture you feel can outperform them, then select and justify the single best model for live deployment. • Data: restrict training to publicly available body-language datasets (for instance, CMU Panoptic, PKU-MMD, Kinetics-Skeleton, or similar) plus any augmentations you create yourself. No private data collection wi...

    $39 Average bid
    $39 Avg Bid
    25 bids

    ...for too long By tracking hand, head and eye direction the system should outperform human observers and keep false positives low. It will be deployed across supermarkets, boutique stores, electronics stores and must remain adaptable to any other retail layout without requiring new cameras. The workflow I have in mind: incoming feed → OpenCV (or equivalent) preprocessing → deep-learning model (YOLO, Detectron2, TensorFlow/PyTorch—whichever you favour) → real-time overlay that highlights the suspect and pops a visual alert in the guard’s monitoring software. No audible alarm is needed right now. Deliverables 1. Trained model and reproducible training pipeline 2. Low-latency inference service that runs on standard Windows or Linux hardware 3. Ope...

    $162 Average bid
    $162 Avg Bid
    42 bids

    I need an end-to-end solution that lets growers upload aerial JPEG, PNG or TIFF shots taken from drones and immediately receive two key pieces of information: the total number of pineapples visible in each image and the average radius of every fruit detected. The work breaks down into three tightly-linked components: 1. Model creation • Build and train a computer-vision model (YOLO, Detect-RON, Mask-RCNN or any comparable deep-learning stack you prefer in PyTorch or TensorFlow) that reliably identifies individual pineapples from top-down drone imagery. • Include a second head or post-processing step that converts pixel dimensions to real-world radius, using the drone’s altitude and sensor data or a calibration grid that I will supply. • Deli...

    $434 Average bid
    $434 Avg Bid
    91 bids

    ...adult students (20+). The task is to build two separate pipelines that estimate emotion from both facial cues and body language, then benchmark their performance: • Pipeline 1 – CNN-BiLSTM: extract spatial features frame-by-frame, pass temporal sequences through a BiLSTM, and classify the five basic emotions (happiness, sadness, anger, fear, surprise). • Pipeline 2 – YOLO-based: detect faces and upper-body regions with YOLO, feed cropped sequences to an emotion head, and output the same five-class predictions. After training on the videos, compare accuracy, F1, and real-time throughput. Once the stronger approach is confirmed, deploy it for live testing (webcam or external camera) so emotions can be displayed or logged in real time. Deliverables m...

    $22 Average bid
    $22 Avg Bid
    14 bids

    ...to track 100+ equine keypoints with sub-pixel precision. Temporal Analysis: Develop spatio-temporal understanding of the jump phases (Approach, Take-off, Flight, Landing) using VideoMAE or TimeSformer. Data Correlation: Translate biomechanical outputs into performance scores and predictive potential markers. Required Technical Stack Languages/Frameworks: Python, PyTorch (Advanced). Vision: YOLO v8-v11, RT-DETR, Ultralytics, DeepLabCut, SLEAP. Spatio-Temporal: VideoMAE, Video Swin Transformer, TimeSformer. Deployment/Optimization: ONNX Runtime, TensorRT, MLOps (Weights & Biases / MLflow). Mandatory Candidate Qualifications We are looking for a top-tier expert. Please only apply if you meet the following criteria: Serious References: You must provide verifiable case s...

    $309 Average bid
    $309 Avg Bid
    6 bids

    ...to track 100+ equine keypoints with sub-pixel precision. Temporal Analysis: Develop spatio-temporal understanding of the jump phases (Approach, Take-off, Flight, Landing) using VideoMAE or TimeSformer. Data Correlation: Translate biomechanical outputs into performance scores and predictive potential markers. Required Technical Stack Languages/Frameworks: Python, PyTorch (Advanced). Vision: YOLO v8-v11, RT-DETR, Ultralytics, DeepLabCut, SLEAP. Spatio-Temporal: VideoMAE, Video Swin Transformer, TimeSformer. Deployment/Optimization: ONNX Runtime, TensorRT, MLOps (Weights & Biases / MLflow). Mandatory Candidate Qualifications We are looking for a top-tier expert. Please only apply if you meet the following criteria: Serious References: You must provide verifiable case s...

    $11 / hr Average bid
    $11 / hr Avg Bid
    4 bids

    ...AI-enabled cameras • Analyze time spent on work-related and non-work-related activities • Provide visual evidence (captured images) with AI-generated comments • Generate accurate time-based reports for management review Key Features 1. Computer Vision & AI Analysis • Real-time video processing using PyTorch-based deep learning models • Human detection and tracking using models such as: • YOLO / Faster R-CNN (object detection) • DeepSORT (person tracking) • Action recognition models (CNN + LSTM / Transformers) • Face and posture recognition (optional and configurable) 2. Behavior Classification The system classifies employee behavior into predefined categories, such as: • Working at desk • Coffee break &bull...

    $4143 Average bid
    $4143 Avg Bid
    89 bids

    ...them must also be supplied in JSON so I can generate either static or interactive charts later. Key statistics that must appear in the output: 1. Player positions and movement traces throughout the match 2. Types of shots taken and whether they resulted in winners, forced errors or unforced errors 3. Rally durations paired with their outcomes Technology preferences are Python with OpenCV, YOLO-based detection, pose estimation for finer tracking, and GPU-accelerated processing on AWS or GCP (or a local CUDA setup if you prefer). A clean, well-documented codebase and brief setup script are part of the hand-off. When you reply, please show: • Examples of previous computer-vision or sports-analytics projects you’ve delivered • A concise outline of the approac...

    $5041 Average bid
    $5041 Avg Bid
    63 bids

    I need a program that can automatically identify the walking direction of a horse in yearling parade videos, detect the frame where the horse is most perpendicular to the camera, and extract a standardised clip around that moment. Requirements: - Process MP4 yearling parade videos - Use YOLO to detect the horse and SAM2 (or YOLO-seg) to generate a segmentation mask for each frame - Calculate the mask centroid per frame and determine walking direction based on consistent left-to-right X-movement - Select only the frames where the horse is clearly walking left-to-right - From those frames, detect the moment where the horse is most perpendicular to the camera - Extract a standardised segment around this point (e.g., ±75 frames or ±2 seconds depending on FPS) - Exc...

    $277 Average bid
    $277 Avg Bid
    45 bids

    ...shotgun apes, bow/sniper swaps, super tracking) • Call of Duty (Warzone/MW3/BO6 – bloom control, perfect movement tracking) • Apex Legends (legend ability detection, ultimate tracking, zip-line prediction) • Battlefield 6 (2042 & next BF – vehicle lock-on, massive player count handling) • Valorant (agent outlines, spike timer sync) • XDefiant + every new shooter that drops Aim Features (all games) • YOLO head/body detection running on Helios 2 • Rage / Legit / Silent / Triggerbot / Memory aim • Humanized smoothing, random offset, bezier curves • Live recoil tables per weapon (drag-drop for noobs, raw values for pros) • FOV circle, priority (closest/head/low HP), bone selection • Prediction for movement, slidi...

    $809 Average bid
    $809 Avg Bid
    67 bids

    ...through Frigate (or your proposed alternative) with detections visualised in its UI. 2. n8n receives distinct webhook calls for configured event type, carrying JSON with camera name, event label, and clip URL. 3. Face library is configurable via a simple folder or API. 4. A short README explains how to retrain models, add cameras, or extend n8n flows. If you’ve worked with Frigate, TensorFlow, YOLO, OpenCV, or similar stacks and can demonstrate a working PoC quickly, let’s talk. Your job is to setup, configure, document all of above and help us build, recreate the same using documentation at our end. Also, to fix any issue that is needed to meet above requirements. Acceptance Criteria: Clean repository with clear build/run instructions. Application should w...

    $309 Average bid
    $309 Avg Bid
    16 bids

    I need a reliable partner to go through my image set and draw tight, well-aligned bounding boxes around two classes only—People and Vehicles. Every image must be reviewed, the boxes placed accurately, and the resulting coordinates exported in a consistent file I can feed straight into my training pipeline (JSON, XML or YOLO txt—let me know which you prefer, I’m flexible as long as the formatting is uniform). Deliverables • All original images unchanged • A matching annotation file for each image or one consolidated file, clearly mapping every bounding box to its class label (“person”, “vehicle”). • A quick spot-check report summarising total images processed and any edge cases you encountered. Quality is far more impor...

    $252 Average bid
    $252 Avg Bid
    17 bids

    ...closely with product teams for roadmap and feature design REQUIRED TECHNICAL EXPERTISE LLM & NLP: • GPT, LLaMA, Claude, Gemini • RAG pipelines, embeddings, summarization Voice AI: • TTS (Azure, ElevenLabs, Coqui) • ASR (Whisper, NeMo) • SSML, voice cloning, audio DSP Video & Avatar AI: • FFmpeg automation, OpenCV, Whisper • Wav2Lip, SyncNet, avatar generation (D-ID, Synthesia) Computer Vision: • YOLO models, segmentation, OCR, moderation filters MLOps & Architecture: • Kubernetes, Docker, FastAPI • Model serving (Triton, TorchServe) • Vector DBs (Pinecone, Weaviate, FAISS) • Airflow, Temporal, CI/CD Backend Systems: • Distributed systems, microservices • REST & WebSocket services • AW...

    $1916 Average bid
    $1916 Avg Bid
    7 bids

    ...(REQUIRED) AI Clothing Recognition When a user photographs an item, the system should: Automatically remove background Detect item category (hoodie, t-shirt, jeans, blazer, etc.) Identify color, patterns, shapes, textures Detect attributes such as neckline, sleeve type, length Classify overall style (casual, formal, streetwear, business, etc.) AI technologies may include: Segment Anything, YOLO, CLIP, custom CV models, Google Vision, or API integrations. Smart Digital Wardrobe A complete closet interface where all identified clothing items appear with: Categories & tags Filters (color, style, season, frequency, category, etc.) Search Usage statistics (least used items, favorites, recently worn) Clean, premium UI/UX Body Avatar Creation (From 3 Photos) Using...

    $21 / hr Average bid
    $21 / hr Avg Bid
    143 bids

    ...platform for real-time video analytics. The system will ingest RTSP/ONVIF camera streams, run real-time AI detection (person, vehicle, intrusion, loitering, unattended objects), generate alerts, store snapshots/clips, and provide dashboards, reports, and forensic search. A detailed FRD is ready. Required Skills: Real-time video processing (RTSP, GStreamer, FFMPEG, ONVIF) AI/Computer Vision (YOLO, TensorRT, DeepStream, OpenVINO, CUDA) GPU-accelerated inference pipelines Multi-tenant SaaS backend (Node.js / Python / Go) Cloud deployment (AWS/GCP) Databases: PostgreSQL/MongoDB, Redis Frontend: React or Vue Experience building similar video analytics systems is mandatory Who Should Apply DO NOT APPLY if you don’t have previous experience in video analytics / AI surveillan...

    $8443 Average bid
    $8443 Avg Bid
    22 bids

    ...machine-learning knowledge into a fully working, camera-driven inspection station for printed-circuit boards. The goal is straightforward: from a live feed you will detect wrong resistor values via colour-code reading, spot diode mis-orientation, and flag capacitor polarity errors. Your daily focus will be real-time camera feed processing. That means writing latency-aware OpenCV pipelines, integrating YOLO (or a comparable object-detection network), and pushing frames through a decision layer that raises on-screen alerts and overlays bounding boxes when something is off. You’ll also help with camera calibration, dataset curation and model training so the system improves as new boards come down the line. Deliverables I’ll review for acceptance: • A Python ...

    $575 Average bid
    $575 Avg Bid
    15 bids

    ...medium-sized retail chains curb shrinkage without demanding a heavy upfront investment. The first priority is the core system itself: a computer-vision workflow that performs real-time monitoring, triggers instant alert notifications and feeds events into a lightweight analytics dashboard for daily and weekly loss-prevention reports. I’d like you to leverage proven open-source frameworks—think YOLO, OpenCV, TensorFlow or similar—so the software can run on inexpensive IP cameras and modest edge devices. From there, I need the entire go-to-market package wrapped around it: a clear step-by-step marketing playbook that speaks to operations directors and loss-prevention managers at mid-size chains, an onboarding guide that shows them exactly how to wire up cameras, ...

    $85 Average bid
    $85 Avg Bid
    50 bids

    I need a complete camera solution that can sit on a fixed pole beside a 6-a-side turf and automatically keep the football centred in the frame at all times. I am counting on image-recognition tracking, not GPS or external sensors, to find the ball and drive a pair o...6-a-side scenario proving that the ball stays near dead-centre during rapid direction changes. • Setup guide so I can mount the unit on any suitable pole beside a pitch and reproduce your results. I’ll test by running a full 10-minute game: if the ball drifts out of centre for more than two seconds, or the stream drops frames, revisions will be required. Let me know which frameworks (OpenCV, YOLO, TensorRT, etc.) you plan to use; I’m happy to consider alternatives as long as they achieve smooth, r...

    $53 Average bid
    $53 Avg Bid
    6 bids

    ...needs more data. The system should ingest standard MP4 files, and produce: Build a detection and classification pipeline using: • Roboflow + YOLO, or • Ultralytics YOLOv8/YOLO11 + MediaPipe, or • MoveNet/SensiAI + classifier • Detect: player, racket, ball, pose, shot type. • Compute timing and technical metrics. • Generate structured JSON: "type_of_shot": "bandeja", "strengths": [], "improvements": [], "score": 82, "overlay_url": "" • Generate human-like feedback using GPT-4o or simirlar. • Benchmark latency + cost per video. • Deliver API or script ready for integration. REQUIRED SKILLS • Computer vision (YOLO, pose es...

    $1454 Average bid
    $1454 Avg Bid
    72 bids

    ...Simple Python API (FastAPI preferred) Accepts uploaded videos Returns processed JSON + snapshot images Store data in lightweight DB (SQLite or Firebase) 3. Simple Dashboard (optional if you can do frontend) View processed data Map with pothole markers Download JSON/CSV You Don’t Need to Build a Full App Just MVP: upload → detect → output. Skills Needed Python + FastAPI Computer Vision (YOLO/Segmentation/Video processing) PyTorch / TensorFlow Basic cloud deployment (AWS/Linode/VPS) Budget ₹60,000 – ₹90,000 (fixed price) Paid in milestones. Small paid test-task will be required (detect 5 potholes in sample video). What to Include in Your Proposal Past work in computer vision (especially object detection) Model you plan to use Delivery timel...

    $1145 Average bid
    $1145 Avg Bid
    47 bids

    I have ...can feed straight into my computer-vision pipeline for detection and training. I do not need facial tagging, scene classification, polygons, or key-point work—only clear, tight bounding boxes around the objects themselves. You may use tools such as LabelImg, CVAT, VoTT, or any comparable platform as long as the final export remains consistent and easy to ingest (Pascal VOC XML, COCO JSON, or YOLO text are all acceptable). Deliverables: • Fully annotated image set with bounding boxes for every object • Corresponding annotation files in the agreed-upon format • A short summary noting the tool used and any edge cases flagged Accuracy and consistency trump speed; a small sample will be reviewed first to confirm the standard before you proce...

    $268 Average bid
    $268 Avg Bid
    14 bids

    I have a small but urgent computer-vision task. My goal is to detect red and green traffic lights on a Raspberry Pi robot, and I need a fully working YOLO model fast. I began with YOLOv11 in Google Colab, yet training fails because many label files are missing or incomplete. Images and TXT files are out of sync, and the folder hierarchy is a mess. What I need from you: • Re-organise the dataset into the standard YOLO structure (images/train, labels/train, etc.), rename files so every image has the correct label, and fix any annotation gaps. • Train a model in Colab—feel free to use YOLOv5, v8, or stay with v11, whichever gives the best result quickly. • Hand over (or equivalent), a Colab notebook that runs end-to-end, and the cleaned dataset zip. ...

    $128 Average bid
    $128 Avg Bid
    50 bids

    ...(if available) 6. License / DRE Number (if available) 7. Website or Profile Link (Zillow, , , etc.) 8. City and County Geographic Coverage Required: Northern California / Bay Area Region Please include ALL realtors in the following counties: • Alameda • Contra Costa • Marin • Napa • San Francisco • San Mateo • Santa Clara • Solano • Sonoma • Sacramento • Yolo • Placer • San Joaquin • Santa Cruz This covers the region from Sacramento down through the East Bay, Peninsula, Silicon Valley, and out toward Tracy and Stockton. Southern California Region Please include ALL realtors in these counties: • Los Angeles • Orange • San Diego • Riverside • San Be...

    $114 Average bid
    $114 Avg Bid
    33 bids