3 comments

  • bdz35 an hour ago

    We are still in the brute force era. Lot of unnecessary compute is happening and that will keep reducing. But the bigger headache with cheaper and smarter is people start running faster and faster in different directions. So cost and complexity moves to keeping things in sync the larger the team gets.

  • PaulHoule an hour ago

    Yes

  • verdverm an hour ago

    It's already happening, see qwen/gemma sized models in the sub 36B category