HarHarVeryFunny 1 hour ago

This seems to be a bit of a misleading headline.

In the current climate limiting someone's use of AI might be expected to be about restricting access or restricting what someone can do with it, but the story here ostensibly seems to be about capacity constraints, not any limitation on what models or capabilities Google is giving Meta access to.

  • londons_explore 53 minutes ago

    These kind of limits happen all the time for big clients.

    Cloud services like to present the illusion of an infinite amount of compute available at a fixed price per unit, but the reality is if you try to use too much of any service you'll find you have a quota and requests to increase it will fall on deaf ears if the provider doesn't have more of that resource.

    Too much of my working life has been spent shoehorning services into less space/compute/ram/spindles or migrations to other data centers to solve such issues.

symisc_devel 1 hour ago

I do believe this will be the norm from now on to get access to top frontier model. Computing capacity plus state restrictions plus KYC will be imposed to organisations to get access, individuals will be served last on the queue with degraded performance. Once the Chinese models catch up, nobody (at least individuals) will turn back again to frontier labs.

  • mden 37 minutes ago

    This seems less about frontier models and restriction and more just lack of compute capacity to meet demand. This has always been an issue for large clients running on cloud, though not to this extent.

HarHarVeryFunny 1 hour ago

It's interesting that Meta is heavily using Google's models (as opposed to Anthropic or OpenAI) given that they are not SOTA for coding. I wonder if this for some strategic/competitive reason, or maybe for cost saving?

  • sarjann 50 minutes ago

    Google tends to be very good at vision and smaller/ edge

    • HarHarVeryFunny 32 minutes ago

      Hmm ... I was assuming they were using these models for development, but I wonder if any of it might be for production instead - perhaps using vision models to analyze posted content? That would certainly be massive scale, but I'd have thought that scale would require them to be running in their own datacenters.

  • dofm 18 minutes ago

    I would imagine there are many situations within Meta's applications where relatively small models can do a good job — sentiment analysis, abusive language detection, characterising users based on their posts, summarising a user's complaint so it can be ignored more efficiently, assessing whether ads are likely to be fraudulent so they can be run more often, etc.

mark_l_watson 37 minutes ago

Misleading title on HN but an interesting article, a reminder of why the hyper scalers are investing heavily in infrastructure.

That said, I expect much of the AI bubble to pop. Google Gemini with Antigravity is a good product, as is a Claude Code subscription but I have switched to using DeepSeek v4 Pro with the Claude Code harness and DeepSeek v4 Flash with the OpenCode harness (when I am not using local models with little-coder/pi) and at least for the foreseeable future I don’t think I am going back. Fast APIs at low cost trumps having to spend a little more time to get the same quality of results.

Zambyte 1 hour ago

Facebook does seem to be falling behind. Does anyone here use Llama over more recent options for any technical reasons?