Fast groq-hosted LLMs vs browser jank
TLDR: chatcraft.org is now smooth as butter and maximally performant thanks to throttling LLM chat-completion rendering via requestAnimationFrame(). ...
TLDR: chatcraft.org is now smooth as butter and maximally performant thanks to throttling LLM chat-completion rendering via requestAnimationFrame(). ...
13 GiB/s per core! Sneller posted a blog on HN on how they use AVX-512 to decompress data at 13 gigabytes per second per core. This a fantastic ad for their “lets turn logs on S3 into cheap database” product. This is a solution I wanted multiple times, will definitely consider them next time the need comes up. Faster than RAM Now this post did not get overlooked, but what did get overlooked is that the post engaged the clickhouse CTO. He posted a link to a presentation on how Clickhouse uses compression to process in-memory data faster than RAM bandwidth. As a result of discussion in these comments, clickhouse might get even faster. ...