As always with Zitron, grab a beverage before settling in.

  • jarfil@beehaw.org
    link
    fedilink
    arrow-up
    1
    ·
    edit-2
    4 days ago

    Keywords: NPU, unified RAM

    Apple is doing it, AMD is doing it, phones are doing it.

    GPUs with dedicated VRAM are an inefficient way of doing inference. They’ve been great for research purposes, into what type of NPU may be the best one, but that’s been answered already for LLMs. Current step is, achieving mass production.

    5 years sounds realistic, unless WW3.