FFmpeg devs boast of up to 94x performance boost after implementing handwritten AVX-512 assembly code

BrikoX@lemmy.zip · 12 hours ago

FFmpeg devs boast of up to 94x performance boost after implementing handwritten AVX-512 assembly code

Mike1576218@lemmy.ml · 10 hours ago

Hand written assembly is pretty common in video, no matter what they say. All modern video codecs have hand written assembly for all modern SIMD extensions, even on ARM. They didn’t say anything about where these numbers come from. Likely compared against unoptimized C code. There will never be a case where having AVX-512 will give you that kind of speedup, because there will be fallbacks for more common extensions.

xan1242@lemmy.dbzer0.com · 8 hours ago

It’s mostly because AVX-512 doesn’t get used too well by compilers even today.

However, what makes this impressive for me is that it is x86 after all. ARM is way easier to write assembly for.

propter_hog [any, any]@hexbear.net · 6 hours ago

Word, major respect for writing x86 assembly. That shit’s trash.

BrikoX@lemmy.zip · edit-2 11 hours ago

Sadly, Intel takes another loss here.

There is an issue, though: Intel disabled AVX-512 for its Core 12th, 13th, and 14th Generations of Core processors, leaving owners of these CPUs without them.

NateSwift@lemmy.dbzer0.com · 11 hours ago

Common Intel L recently. Shame it effects 12th gen

cmnybo@discuss.tchncs.de · 11 hours ago

I’m surprised Intel would remove a feature that AMD provides in their desktop CPUs.

deegeese@sopuli.xyz · 35 minutes ago

Probably some BS market segmentation move.

I imagine they noticed only certain server customers were using those extensions, so decided to limit them to high margin server SKUs.

It would have been a smart move if there weren’t competitors putting that instruction in every CPU.

cubism_pitta@lemmy.world · 10 hours ago

Then this is your reminder that ALL AMD CPUs are Unlocked and support overclocking…

BrikoX@lemmy.zip · 9 hours ago

True, though it’s worth noting that AMD focus efficiency means that there isn’t a lot of extra performance you can get from their modern CPUs with overclocking.

boonhet@lemm.ee · 9 hours ago

I think it’s less because of their efficiency focus and more because the chips already auto-overclock to reasonably high levels.

BrikoX@lemmy.zip · 9 hours ago

I meant more of the whole approach of designing chips with efficiency as a top priority which means they get the best performance they can within their efficiency targets, which is always more optimal than users tinkering on their own. Efficiency and performance are kind of different sides of the same coin.

shadow2@startrek.website · 9 hours ago

My reactions: Ooooh… Awwww. 😮‍💨

AOCapitulator [they/them, she/her]@hexbear.net · 10 hours ago

Hand written is impressive? I thought all code was hand written until AI

dev_null@lemmy.ml · 6 hours ago

It’s handwritten assembly as opposed to bytecode generated by a compiler, from handwritten higher level language.

BrikoX@lemmy.zip · 9 hours ago

It’s about code optimization and efficiency. Most assembly code these days just relies on compilers for optimization as hand optimizing is extremely time-consuming work.

propter_hog [any, any]@hexbear.net · 6 hours ago

To add to this, it’s also very easy and very likely to write assembly that has zero speedup or even significant slowdown versus what the compiler will write.

Cyborganism@lemmy.ca · 12 hours ago

Holy shit.