Artificial Intelligence

Out of context: Reply #1333

Started
Last post
1,374 Responses

prophetone1
GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, and image and generates any combination of text, audio, and image outputs. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversation. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper in the API. GPT-4o is especially better at vision and audio understanding compared to existing models.
prophetone 1Permalink
Upvote Downvote
Flag
- F hell!
  
  https://twitter.com/…grafician
- Bye google, meta etc.grafician
- Can't wait for this in iOS 18-19grafician
- They already using the word "magic" in everything...grafician
- God, it's bad enough that people walk around with their speaker phone on. It's gonna be dreadful hearing people talking to nothing everywhere.formed
- It's pretty glitchy and dude immediately moves on to cover it upShenanigansTV
- When things aren't that good, people move through them as fast as possible.ShenanigansTV
Show [[ numHiddenNotes ]] more notes Add Note
Save Cancel

View thread