**comex** @comex@mas.to · Jan 27, 2023, 16:53

**comex** @comex@mas.to · Jan 27, 2023, 16:53

comex @comex@mas.to

Jan 27, 2023, 16:53

If you take a modern C++ or Rust compiler and think about optimizing it…
There’s lots of “low-hanging fruit” in the form of incrementality and parallelism.

Not truly “low-hanging” as in easy to implement; it’s actually extremely hard. But it’s easy to theorize about. It’s known to be *possible*, and capable of massive speedups in various cases.

But what about the rest? How much room is there for large speedups just by optimizing algorithms? To me that feels like much more of an unknown.

**John Regehr** @regehr@mastodon.social · Jan 27, 2023, 17:16

**John Regehr** @regehr@mastodon.social · Jan 27, 2023, 17:16

Jan 27, 2023, 17:16

John Regehr @regehr@mastodon.social

@comex I have this idea that compilers represent data incorrectly for modern platforms. Currently we have a vast ocean of tiny nodes full of pointers, and we just follow pointers all day. what we need is regular, tabular internal representations that we can throw onto tensor cores

**I Can't Believe It's Not Zero!** @steve@discuss.systems · Jan 28, 2023, 13:42

**I Can't Believe It's Not Zero!** @steve@discuss.systems · Jan 28, 2023, 13:42

Jan 28, 2023, 13:42

I Can't Believe It's Not Zero! @steve@discuss.systems

@regehr @comex @shriramk not even for “tensor cores”! There’s a couple orders of magnitude available on CPU this way.

**Thomas Dickerson** @elfprince13@mumak.app · 2023-01-28T14:39:40Z

Thomas Dickerson @elfprince13@mumak.app

@regehr @steve @comex @shriramk even where you need tree data structures, plenty of use cases where you can improve coherency and reduce the working set by storing levels within the tree densely and calculating offsets into the next row

Jan 28, 2023, 14:39 · · Metatext · · ·

Resources

Developers

What is Mastodon?

mumak.app

More…