For the physical hardware I use the esp32-s3-box[1]. The esphome[2] suite has firmware you can flash to make the device work with HomeAssistant automatically. I have an esphome profile[3] I use, but I'm considering switching to this[4] profile instead.
For the actual AI, I basically set up three docker containers: one for speech to text[5], one for text to speech[6], and then ollama[7] for the actual AI. After that it's just a matter of pointing HomeAssistant at the various services, as it has built in support for all of these things.
For the actual AI, I basically set up three docker containers: one for speech to text[5], one for text to speech[6], and then ollama[7] for the actual AI. After that it's just a matter of pointing HomeAssistant at the various services, as it has built in support for all of these things.
1. https://www.adafruit.com/product/5835
2. https://esphome.io/
3. https://gist.github.com/tedivm/2217cead94cb41edb2b50792a8bea...
4. https://github.com/BigBobbas/ESP32-S3-Box3-Custom-ESPHome/
5. https://github.com/rhasspy/wyoming-faster-whisper
6. https://github.com/rhasspy/wyoming-piper
7. https://ollama.com/