• technomad
    link
    fedilink
    English
    3322 hours ago

    Also the incessant ammounts of power/energy that they consume.

        • @jsomae@lemmy.ml
          link
          fedilink
          5
          edit-2
          11 hours ago

          I don’t have a source for that, but the most that any locally-run program can cost in terms of power is basically the sum of a few things: maxed-out gpu usage, maxed-out cpu usage, maxed-out disk access. GPU is by far the most power-consuming of these things, and modern video games make essentially the most possible use of the GPU that they can get away with.

          Running an LLM locally can at most max out usage of the GPU, putting it in the same ballpark as a video game. Typical usage of an LLM is to run it for a few seconds and then submit another query, so it’s not running 100% of the time during typical usage, unlike a video game (where it remains open and active the whole time, GPU usage dips only when you’re in a menu for instance.)

          Data centers drain lots of power by running a very large number of machines at the same time.

          • @msage@programming.dev
            link
            fedilink
            24 hours ago

            From what I know, local LLMs take minutes to process a single prompt, not seconds, but I guess that depends on the use case.

            But also games, dunno about maxing GPU in most games. I maxed mine for crypto mining, and that was power hungry. So I would put LLMs closer to crypto than games.

            Not to mention games will entertain you way more for the same time.

    • Sixty
      link
      fedilink
      English
      111 hours ago

      Curious how resource intensive AI subtitle generation will be. Probably fine on some setups.

      Trying to use madVR (tweaker’s video postprocessing) in the summer in my small office with an RTX 3090 was turning my office into a sauna. Next time I buy a video card it’ll be a lower tier deliberately to avoid the higher power draw lol.