
Closed
Posted
Paid on delivery
I need a production-ready wake-word model that reliably responds to “Hey Bobo” on both ESP32 and Raspberry Pi boards. Accuracy is critical: in a moderately noisy room it should trigger every time someone clearly says the phrase, yet ignore most unrelated speech. I still want slight phonetic wiggle-room—utterances such as “hello bo,” “hey bebo,” or “hello bebo” should wake the system as well—so fine-tuned threshold settings and a well-balanced dataset will be essential. What I expect from you • A trained wake-word model or firmware that runs in real time on ESP-IDF for ESP32 and on Raspbian/Ubuntu for Raspberry Pi without requiring cloud calls. • Demo code that shows how to load the model, stream audio from an onboard mic, and raise an event when the wake word (or approved variants) is detected. • Instructions for collecting additional audio samples so I can keep refining the model if needed, plus clear guidelines on adjusting sensitivity to maintain high precision while preventing false positives. • CPU- and memory-usage figures on each platform so I know exactly how lightweight the solution is. Acceptance criteria 1. Latency from spoken phrase to callback ≤ 250 ms on both targets. 2. ≥ 95 % wake-up rate in moderate background noise (office chatter, music at low volume). 3. ≤ 2 false activations per hour of continuous speech. 4. Complete build steps and source so I can reproduce the binary from scratch. You are free to use tools like TensorFlow Lite Micro, Porcupine, Picovoice DIY, or a custom DSP pipeline—as long as licensing permits commercial use. I am happy to provide extra voice recordings to help you fine-tune. If you’ve shipped similar models before, let me know; a quick video demo on actual hardware will fast-track selection. data collection and everything should be done solely by the freelanacer's job.
Project ID: 40282183
8 proposals
Remote project
Active 6 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
8 freelancers are bidding on average ₹19,469 INR for this job

Hi there, I am an ML engineer. I can start right away and deliver within the deadline. So, Let’s have a quick conversation. I can be more specific once we get all the requirements and information required to execute the project. Thank you!!
₹25,000 INR in 7 days
5.6
5.6

Hey, I liked your project, Precise “Hey Bobo” Wake-Word and believe I can help you with the project. With my background in Python, Linux, Electronics, I'm confident I can meet your requirements. Would be glad to go over specifics if you're interested.
₹12,500 INR in 7 days
3.8
3.8

You’re looking to develop a production-ready wake-word model that reliably recognizes “Hey Bobo” and its phonetic variants on both ESP32 and Raspberry Pi, with strict accuracy and latency requirements in moderately noisy environments. You need a fully offline solution running on ESP-IDF and Raspbian/Ubuntu, complete with demo code, performance metrics, and clear instructions for ongoing data collection and sensitivity tuning. With over 15 years of experience and 200+ projects completed, I specialize in Python, Linux, embedded systems, audio processing, and machine learning—skills that align perfectly with your need for a lightweight, real-time wake-word detection system on constrained hardware. I have previously delivered similar embedded audio models optimized for low latency and minimal false positives. I will start by designing a balanced dataset capturing your target phrase and approved variants, then train and optimize a TensorFlow Lite Micro or Porcupine-based model tailored to the ESP32 and Raspberry Pi platforms. The deliverables will include fully tested firmware, demo applications for streaming audio and event signaling, detailed resource usage reports, and clear documentation to enable you to refine the model further. I expect to complete this within 4–5 weeks. Let’s discuss your exact requirements and how I can help bring this wake-word system to life.
₹13,750 INR in 7 days
2.6
2.6

Drawing from my expertise in Electrical Engineering, Electronics and Python programming, I propose an outstanding solution to one of your critical needs: a reliable wake-word model that performs at optimal levels on both the ESP32 and Raspberry Pi boards. In particular, I am skilled in the use of tools like TensorFlow Lite Micro, Porcupine and Picovoice DIY which would be highly beneficial to this project. Importantly, I have experience with delivering edge AI solutions with little or no reliance on cloud services; this means I can provide you with a wake-word model that runs on ESP-IDF for ESP32 and Raspbian/Ubuntu for Raspberry Pi without the need for cloud calls, simultaneously ensuring low latency. In addition to meeting all your requirements for this project, my approach would be comprehensive. Alongside delivering an optimized and efficient wake-word model that boasts a ≥ 95 % wake-up rate in moderate background noise, with less than 2 false activations per hour, my deliverables will also include clear instructions on collecting additional audio samples. You will be empowered to continue fine-tuning the model if needed while maintaining high precision.
₹12,500 INR in 6 days
0.0
0.0

Hello, I can build a production-ready wake-word detection system for “Hey Bobo” that runs locally on both ESP32 and Raspberry Pi without any cloud dependency. I have extensive experience with embedded Linux systems, audio pipelines, and lightweight AI models for edge devices. My approach focuses on reliability, low latency, and efficient CPU/memory usage. Technical approach I will implement a custom wake-word detection pipeline using a lightweight neural network optimized for edge devices. The system will include: • Real-time audio capture from microphone • DSP feature extraction (MFCC) • Wake-word classification model • Event callback when the phrase is detected The solution will run entirely offline and will be optimized for both Raspberry Pi and ESP32 environments. Key features • Wake-word detection for “Hey Bobo” with phonetic tolerance (hello bo, hey bebo, etc.) • Low latency response (<250 ms) • High accuracy in moderately noisy environments • Low CPU and memory footprint suitable for embedded hardware • Fully reproducible build and source code
₹25,000 INR in 7 days
0.0
0.0

Hi, I went through your requirements and it fits very nicely with my skill set, matter of fact I have this very similar project that meets your goals. We can discuss further, I can show you what I have and we can work forward on completing this very soon!!
₹20,000 INR in 4 days
0.0
0.0

India
Member since Mar 7, 2026
$10-30 USD
$40-50 USD
$42 USD
$10-30 USD
$30-250 AUD
$3-4 USD / hour
£20-250 GBP
$10-30 AUD
$2-8 USD / hour
$10-30 USD
₹1500-12500 INR
$30-250 USD
$10-30 USD
$30-250 CAD
₹600-1500 INR
£20-250 GBP
$2-8 USD / hour
$10-30 USD
$30-250 USD
₹12500-37500 INR