Skip to content

Latest commit

 

History

History
137 lines (94 loc) · 7.67 KB

README.md

File metadata and controls

137 lines (94 loc) · 7.67 KB

A text-to-speech chatbot demo built using Nextjs, OpenAI's GPT-3 Chat Completions API, and ElevenLabs' Text-to-Speech API.


Tip

Kindly read this comprehensive tutorial to learn how this was built and how you can build yours.


Table of Contents


Features

  • A responsive, user-friendly, and performant UI built with TypeScript, Reactjs, Nextjs App Router, Tailwind CSS, and ElevenLabs TypeScript SDK.
  • PWA support for installation on mobile devices.
  • Users can ask a question by text and get a text response.
  • Users can ask a question by text and get an audio response.
  • Users can change the voice of the audio response (40+ options).
  • Users can replay the last audio response.
  • Users can start a new chat session.
  • Chat history and voice settings are saved to local storage.
  • Public live demo for users to explore with their API keys.
  • Stream the text response from OpenAI.
  • Allow process termination when the response is streaming (refactor the current loading indicator to a stop button).
  • Pass the stream chunks from OpenAI directly to ElevenLabs.
  • Stream the audio response from ElevenLabs to the UI.
  • Increase the response tokens and characters.
  • Add an option to choose the language translation of the audio response (using the eleven _multilingual_v2 model).
  • Theme toggle for light mode.

Important Files and Folders

Path Description
.env.example Example file with all the required environment variables.
/app/chat/route.ts API route handler for communicating with OpenAI.
/app/speech/route.ts API route handler for communicating with ElevenLabs.
/app/components/storeApiKeys.tsx React component for the user API keys section of the UI.
/app/components/chatVoice.tsx React component for the voice selection section of the UI.
/app/components/chatMessages.tsx React component for the chat messages section of the UI.
/app/components/chatControls.tsx React component for the user controls section of the UI.
/app/components/chatInput.tsx React component for the user input section of the UI.
/hooks/useLocalStorage.ts React hooks to save and fetch data from localStorage.
/app/layout.tsx Shared UI for fonts and metadata configuration.
/app/page.tsx Home page (/).
/app/chat/page.tsx Chat page (/chat).
/utils/getVoices.ts Utility file to fetch voices from ElevenLabs.
/utils/notifyUser.ts Utility file for toast notifications.
/types/chat.ts Types for the entire project.

Getting Started

To run this application locally, kindly follow the steps below:

  1. Rename the .env.example file to .env.local and fill in the required environment variables.

  2. Install all required dependencies with the npm install command (or use yarn / pnpm).

  3. Run the development server with the command npm run dev.

  4. Open http://localhost:3000 with your browser to see the result.

  5. All good! You can start modifying any page and the app will auto-update.

API Keys Guide

To enable anyone to test the application in production if they want to, I've added a form where they can enter their API keys for both OpenAI and ElevenLabs (so I don't have to cover the costs and deal with request overload since this is a basic demo for now). This is a good way for anyone to test the application without having to clone and run it locally. To create both API keys (if you don't have one already), kindly read this section of the tutorial. Once you have the keys, click on the settings icon at the top section of the page, and enter them in the form provided. You can repeat the same process to update the keys.

Warning

Both API keys are stored on your local computer and are not saved to any database or sent to any server/third-party service (they're stored in localStorage and you can read the code yourself). The keys are only used to make API requests on your behalf each time you want to use the app. This is safe and secure for anyone to use if you guarantee that no one else has access to your computer or browser. You can use the "Delete" button in the form to remove the API keys from your local storage at any time.

Enjoy chatting with Siri!

Troubleshooting

If you're accessing the application on the Safari browser, you might get the NotAllowedError: The request is not allowed by the user agent or the platform in the current context, possibly because the user denied permission error. This is because the browser doesn't allow autoplaying audio by default. You can fix this by going to the browser's Auto-Play settings as seen in the screenshot below and changing the option for the chat-with-siri.vercel.app website from Stop Media with Sound to Allow All Auto-Play. If you refresh now, the audio should work fine.

Repo Stats Summary

GitHub Repository Statistics

Contributors Guide

  1. Fork this repository (learn how to do this here).

  2. Clone the forked repository like so:

git clone https://github.com/<your username>/chat-with-siri.git && cd chat-with-siri
  1. Make your changes and create a pull request (learn how to do this).

  2. I'll attend to your pull request soon and provide some feedback.

License

This repository is published under the MIT license.