GitHub - naclonts/audio-visual-bot: A Raspberry Pi based conversational bot with visual capabilities

This repo contains the code for a conversational robot toy.

The capabilities of the robot, as of this commit, are:

Speech to text conversion of microphone audio in English to text. (Whisper)
Sending the text to an LLM and getting a conversational response. (Claude)
Converting the LLM's response to audio and playing it. (ElevenLabs)
Performing sentiment analysis on the LLM response and lighting a green or red LED for positive or negative sentiment. (DistilBERT)
Animating a small OLED display to illustrate whether the robot is currently listening, thinking, or speaking.
Based on camera input, locating any faces in the frame and moving pan/tilt servos to point at the face. (OpenCV, Haar cascade)

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
image_search		image_search
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
animations.py		animations.py
conversation.py		conversation.py
main.py		main.py
object_tracking.py		object_tracking.py
pantilthat_face_tracker_test.py		pantilthat_face_tracker_test.py
requirements.txt		requirements.txt
sentiment_led.py		sentiment_led.py

Provide feedback