Artificial Intelligence

Out of context: Reply #1333

  • Started
  • Last post
  • 1,374 Responses
  • prophetone1

    GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, and image and generates any combination of text, audio, and image outputs. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversation. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper in the API. GPT-4o is especially better at vision and audio understanding compared to existing models.

    • F hell!

      https://twitter.com/…
      grafician
    • Bye google, meta etc.grafician
    • Can't wait for this in iOS 18-19grafician
    • They already using the word "magic" in everything...grafician
    • God, it's bad enough that people walk around with their speaker phone on. It's gonna be dreadful hearing people talking to nothing everywhere.formed
    • It's pretty glitchy and dude immediately moves on to cover it upShenanigansTV
    • When things aren't that good, people move through them as fast as possible.ShenanigansTV

View thread