For the physical hardware I use the esp32-s3-box[1]. The esphome[2] suite has fi...

For the physical hardware I use the esp32-s3-box[1]. The esphome[2] suite has firmware you can flash to make the device work with HomeAssistant automatically. I have an esphome profile[3] I use, but I'm considering switching to this[4] profile instead.

For the actual AI, I basically set up three docker containers: one for speech to text[5], one for text to speech[6], and then ollama[7] for the actual AI. After that it's just a matter of pointing HomeAssistant at the various services, as it has built in support for all of these things.

1. https://www.adafruit.com/product/5835

2. https://esphome.io/

3. https://gist.github.com/tedivm/2217cead94cb41edb2b50792a8bea...

4. https://github.com/BigBobbas/ESP32-S3-Box3-Custom-ESPHome/

5. https://github.com/rhasspy/wyoming-faster-whisper

6. https://github.com/rhasspy/wyoming-piper

7. https://ollama.com/