Hasty Briefsbeta

Bilingual

How to Build a Shitty Robot

2 days ago
  • #STEM Education
  • #DIY Robotics
  • #LLM Integration
  • A low-cost toy robot was purchased and disassembled to create an LLM-powered, voice-controlled robot for children, emphasizing simplicity and readily available materials.
  • The robot's original PCB was reverse-engineered, revealing a simple design with a single motor using gears for walking and turning; the main IC was removed to hijack the H-bridge for motor control.
  • A new chassis was built from cardboard and a paper cup, housing the electronics and a phone, which serves as the display and audio interface, controlled via an Adafruit FT232H board for GPIO.
  • A client-server architecture was implemented, with the server handling speech-to-text, text-to-speech, LLM, and agent logic locally for privacy, while the client runs in a browser using WebUSB (except Safari).
  • Speech-to-text uses Parakeet TDT 0.6B with Silero VAD for real-time transcription, and text-to-speech uses Qwen3 TTS for voice cloning, both optimized in Rust to avoid Python dependencies.
  • The pipeline supports barge-in for interruptions, low-latency responses via sentence chunking, and tools like web search, photo capture, motor control, and Spotify integration, powered by local LLMs like Gemma 4 26B.