Until now, ChatGPT, the popular conversational tool with Artificial Intelligence developed by was only capable of offering text responses. Since September 2023, questions can be asked via voice, but until now it could not respond through the same means.
However, now the company that created Chat GPT, OpenAI, has launched a new multimodal functionality that allows you to receive them out loud. This can be very useful for when, for example, you are performing another task while consulting ChatGPT or there is no way to look at a screen (or for chat to be integrated on devices that do not have one). Also so that people with visual deficiencies can use the tool.
Of course, it comes after one of OpenAI’s competitors, Anthropic, has also added the possibility of responding through more than one medium (multimodality) to its Artificial Intelligence models. Combining the functionality launched in September with this, you can “have a conversation” with ChatGPT and ask questions with voice prompts and get the answers out loud.
How ChatGPT “Read Aloud” works
The tool developed by OpenAI, called “Read Aloud”, is now available in both the web version of ChatGPT and the iOS and Android applications for ChatGPT. In addition, it can be used in both GPT-4 and GPT-3.5.
The GPS-like functionality allows the user to select five different voice options, both male and female. “Read Aloud” can be used in 37 different languages at the time of its launch, although the company says it will release more in the future.
ChatGPT has the ability to automatically recognize the language in which the text has been written. You could even read aloud sentences written in several different languages.
In addition, in mobile applications it incorporates more functionalities. For example, you can press the “Read Aloud” player to stop text playback. You can also “rewind” to start the response again from the beginning.