Most Viewed Content:

Cygnus space cargo spacecraft arrives at International Space Station with only half of its solar array

NASA astronaut Nicole Mann, backed up by NASA astronaut...

Toyota responds to continued production cuts in the next 3 months: easing pressure on dealer earnings

In response to the news that "production will continue...

Google to bring PWA application backup & restore function for Chrome/android

According to thespAndroid reports, GitHub's Chromium repository recently added...

OpenAI Updates ChatGPT: Support for Image and Voice Inputs

Recently OpenAI announced the launch of a new version of ChatGPT, adding two new features: voice input and image input. According to OpenAI, the new features will be rolled out to ChatGPT Plus subscribers in the next two weeks, and others will be able to use these features “soon”.

The voice input feature is similar to a voice assistant on a cell phone, in that the user simply presses a button, says their question, and ChatGPT converts it to text, generates the answer, and then converts the answer to speech and plays it back to the user. openAI says this is a much more natural and convenient way of interacting with the user, and because of LLM’s technology, the answers will be of higher quality. openAI has also developed a new text-to-speech feature, which will be available to ChatGPT Plus subscribers “very soon”. OpenAI has also developed a new text-to-speech model that generates human voices based on a few seconds of sample speech. Users can choose from five options for ChatGPT’s voice, and there are more potential uses for this model. For example, OpenAI is working with Spotify to translate podcasts into other languages while preserving the voice of the podcast host. However, there are some risks associated with the model, such as the possibility that it could be used maliciously to impersonate public figures or commit fraud. As a result, OpenAI says the model will not be widely open, but will be strictly controlled and limited.

The image input function is similar to Google Lens, allowing users to take pictures of things they are interested in and upload them to ChatGPT, which tries to recognize what the user wants to ask and give them the appropriate answer. Users can also use the app’s drawing tools to help express their questions or communicate with voice or text input, and ChatGPT has the advantage of being able to have multiple conversations rather than a one-time search. If the user is not satisfied with the answer or wants more information, they can continue to ask ChatGPT questions to get a more accurate and comprehensive answer. Of course, there are some potential problems with image search. For example, when dealing with images of people, OpenAI says they have limited ChatGPT’s ability to analyze and directly evaluate people, both to ensure accuracy and to protect privacy, meaning that uploading a person’s photo to know who he/she is is not yet possible.

It is noted that since launching ChatGPT in early 2022, OpenAI has been working hard to add more features and capabilities to its bot while avoiding causing new problems to arise. With this update, the company is trying to find a balance on that line, by consciously limiting what its new models can do. But this approach isn’t a long-term solution, and as more and more people use voice control and image search, and as ChatGPT evolves into a truly multimodal and useful virtual assistant, it’s going to become increasingly difficult to maintain safe and sensible boundaries.

Latest

Starting from 48,900, Geely Panda Karting officially starts pre-sale

Geely Panda Karting officially started pre-sale. The pre-sale price...

Ford: Expand charging network, fuel/ hybrid/ pure electric in parallel

Recently, Ford released the company's comprehensive annual report for...

Chery’s two new cars are exposed, targeting overseas markets

Recently, some media exposed the actual cars of two...

New Trumpchi Shadow Leopard to launch on May 1, upgraded performance rims

Recently, we learned from the official that the 2024...

Newsletter

Don't miss

Starting from 48,900, Geely Panda Karting officially starts pre-sale

Geely Panda Karting officially started pre-sale. The pre-sale price...

Ford: Expand charging network, fuel/ hybrid/ pure electric in parallel

Recently, Ford released the company's comprehensive annual report for...

Chery’s two new cars are exposed, targeting overseas markets

Recently, some media exposed the actual cars of two...

New Trumpchi Shadow Leopard to launch on May 1, upgraded performance rims

Recently, we learned from the official that the 2024...

Samsung Galaxy S25 Ultra expected to feature 5000mAh + 45W Combo

Technology media WccFtech recently reported that Samsung will not...
Stephen Cruise
Stephen Cruisehttps://www.techgoing.com
Stephen Cruise is a senior editor covering latest smartphones, EVs, PC gaming, console, and tech with 11 years of experience.

Honda Plans Electric Vehicle Supply Chain Project in Canada: 240K Annual Capacity

Honda recently announced plans to build an electric vehicle supply chain complex in Ontario, Canada, with the goal of starting production in 2028. After...

Lei Jun: Xiaomi SU7’s 10,000th vehicle officially rolls off the production line

Lei Jun said on social platforms: "32 days after the release of Xiaomi SU7, our 10,000th mass-produced vehicle officially rolled off the production line."...

2024 Beijing Auto Show: Mid-term facelift BMW 4 Series launched

The 2024 Beijing Auto Show officially kicked off. The BMW brand brought the new 4 Series to consumers. A total of 4 configurations are...