Taras' Blog on AI, Perf, Hacks

PSA: eget That Executable From GitHub

tldr: GitHub is where most CLI tools live. eget is a tool that makes it easy to download and install their binaries. ...

My Black and White Reading Setup

If you’re happy reading on your laptop or Apple device, you can skip this post. ...

llama.cpp: use local models with chatcraft

I needed a way to do some programming while offline. These days I feel very unproductive without https://chatcraft.org (the best chat UI for programming) and a good LLM to chat with about coding. Chatcraft needed a few small fixes to enable llama.cpp support. Here’s how to run models with llama.cpp with chatcraft.org without internet: Instructions Install and run llama.cpp. Follow https://github.com/ggerganov/llama.cpp instructions for your platform. For mac: # install llama.cpp brew install llama.cpp # start llama.cpp server & auto-download a good small (~6GB) model llama-server --hf-repo "bartowski/Llama-3-Instruct-8B-SPPO-Iter3-GGUF" --hf-file Llama-3-Instruct-8B-SPPO-Iter3-Q6_K.gguf # For more advanced usage I recommend gemma 27b: the smallest smarter-than-gpt-3.5 model (~21GB) # llama-server --hf-repo bartowski/gemma-2-27b-it-GGUF --hf-file gemma-2-27b-it-Q6_K_L.gguf Setup local chatcraft dev env by following the instructions in the chatcraft repo git clone https://github.com/tarasglek/chatcraft.org/ cd chatcraft.org pnpm install pnpm dev ^ will output a development url like http://localhost:5173/, open it. Go to chatcraft settings and add http://localhost:8080/v1 to api providers. Enter a dummy api key. Enjoy! ...

Fast groq-hosted LLMs vs browser jank

TLDR: chatcraft.org is now smooth as butter and maximally performant thanks to throttling LLM chat-completion rendering via requestAnimationFrame(). ...

Developing markdown.download: Exporing val.town, deno, jsr.io

I read a lot. I enjoy reading on black and white e-readers. Unfortunately many websites make it hard to read them on simple devices. https://markdown.download is my really simple solution to that. Prepend it to any website and curl: curl https://markdown.download/https://dev.to/amnish04/introducing-the-idea-of-web-handlers-in-chatcraft-1b4i That turns a user-hostile website into a thing that almost anything can render: It’s amazing how much cleaner the web is when stripped down to essence. Turns out markdown (or html restricted to what markdown can do) renders fantastic on readers of all types. Since markdown.download is so utterly simple, I decided to try using it to learn a new-to-me dev tool: val.town. Rest of this post covers what I learned. I initially was going to publish this project as an npm lib and deploy on cloudflare, but was putting it off because I loathe the amount of bureaucracy that involves. I love the type system in Typescript. I love Node.JS because it’s the least-bad way to write/deploy apps, but much room for improvement remains. My spare time is too valuable to spend it suffering. val.town: brutally efficient microservice dev-ux...