nickandbro 27 minutes ago

Wow, if the benchmarks checkout with the vibes, this could almost be like a Deepseek moment with Chinese AI now being neck and neck with SOTA US lab made models

  • motoboi 17 minutes ago

    With the previous generation? Yes. With 10T mythos-level models? Not even close.

    • bestouff 13 minutes ago

      There's no public data about Mytho.

      • maplethorpe 9 minutes ago

        That's because it would be too dangerous to release.

    • amazingamazing 10 minutes ago

      The psyop continues. Mythos until it’s released is vaporware. Notice how you can try kimi 2.6. Where is the same for mythos?

    • ChrisLTD 8 minutes ago

      Mythos isn't the current generation, it's literally vaporware.

    • jollymonATX 4 minutes ago

      According to the benchmarks, you are wrong. It is on track and slightly above some sota. Just the benchmarks speaking there, they can be/are gamed by all big model labs including domestic.

lbreakjai 8 minutes ago

I have a subscription through work, I've been trialing it, so far it looks on par, if not better, than opus.

irthomasthomas 39 minutes ago

Beats opus 4.6! They missed claiming the frontier by a few days.

  • NitpickLawyer 20 minutes ago

    While I'm skeptical of any "beats opus" claims (many were said, none turned out to be true), I still think it's insane that we can now run close-to-SotA models locally on ~100k worth of hardware, for a small team, and be 100% sure that the data stays local. Should be a no-brainer for teams that work in areas where privacy matters.

    • cedws 12 minutes ago

      Even the smaller quantized models which can run on consumer hardware pack in an almost unfathomable amount of knowledge. I don't think I expected to be able to run a 'local Google' in my lifetime before the LLM boom.

  • BoorishBears 18 minutes ago

    Opus is clearly a sidegrade meant to help Anthropic manage cost, so I would say they may have it if it actually beats 4.6

    • irthomasthomas 12 minutes ago

      Could be right. I just noticed my feed is absent the usual flood of posts demoing the new hotness on 3D modeling, game design and SVG drawings of animals on vehicles.

pt9567 6 minutes ago

wow - $0.95 input/$4 output. If its anywhere near opus 4.6 that's incredible.

greenavocado 5 minutes ago

I pray the benchmark figures are true so I can stop paying Anthropic after screwing me over this quarter by dumbing down their models, making usage quotas ridiculously small, and demanding KYC paperwork.

wolttam 13 minutes ago

Some of these numbers are bonkers, gonna have to try it.