LLM Experiments
·2 mins
Local inference experiments running Qwen 3.6 models on llama.cpp. Tweaking parameters to avoid paying a token subscription!
Local inference experiments running Qwen 3.6 models on llama.cpp. Tweaking parameters to avoid paying a token subscription!
My Neovim setup centered around Claude Code for agent-driven development. Treats local LLMs hosted via OpenAI-compatible APIs as first-class citizens!
Self-hosted TalosOS Kubernetes platform running ~30 services across ARM and x86 nodes via GitOps (ArgoCD) with full observability, secrets management, storage, and the ability to incorporate & run GPU inference.
I don’t trust big tech anymore. Neither should you! This project documents how I set up & automate rsyncs to my LUKS drives.