ʻĀina Foundry Prototypes (Page 2)

Policy-Based LLM Routing with Nvidia's Open Source Blueprint

People David Idea Testing Nvidia's v1 LLM Router blueprint as a third approach to intelligent query routing - this time using policy-based task classification instead of semantic similarity or neural network intent matching. Details * Nvidia's v1 LLM Router blueprint (main branch) takes a three-step approach: apply

Neural Network Intent Routing with UIUC's LLMRouter

People: David Idea: Tested UIUC's LLMRouter framework as an alternative to LiteLLM's semantic routing - this one trains an actual neural network for intent classification and can run on hardware as small as a Raspberry Pi with 4GB of ram. Details: * Wanted to compare this against

Basic Semantic Routing with LiteLLM Proxy

People: David Idea: Testing semantic routing as a way to automatically send LLM requests to different models based on what the user is asking—part of a bigger vision for smart edge-to-hub-to-cloud routing on our Maui cluster. Details: * Our LiteLLM proxy already bundles multiple machines into one API endpoint, so

Automous terminal agent accessible through Slack

People: David Idea: Testing KIRA, Krafton's open-source project that lets you run a full Claude Code instance through Slack, on our local PMF hardware. Details: * Nicole mentioned wanting more Slack bot and automation options for the team * Found Krafton's KIRA project on GitHub which does exactly

CrewAI plus open-source LLMs for marketing strategy

People: Me Idea: Tested whether local/open-source models can handle real marketing strategy work using CrewAI as the orchestration layer. Details: * Wanted to see how open source tools do for automating some tasks in marketing * Used gpt-oss-120b as the backbone model * Ran everything through CrewAI for agent orchestration * Got Joe&

Qwen3-32B on AMD's 7900XTX

People: Me Idea: I wanted to see how well the Qwen3-32B model runs on an AMD 7900XTX using different quantization formats and inference backends—spoiler: AWQ is not the move on this generation of consumer AMD card. Details: * Tested on a host system with Ubuntu 24.04 with ROCm 7.

Testing Qwen3-Omni Audio Inputs

People David Idea Part of our work in the Kumubot cluster involves being able to work on both text as well as audio recordings - the idea was the figure out the capabilities of Qwen3-Omni on our Blackwell hardware Details * Blackwell software support (at least for the RTX 6000 Pro)

Trying AI-Trader on local LLMs

People David Idea Joe wanted to know what the capabilities of https://github.com/HKUDS/AI-Trader look like in the context for local LLMs Details * Looks like the assumption of the repo is that you can test multiple AI in agentic fashion against historical stock market data over a period

4bit Quant Showdown: Finding the Sweet Spot for Qwen3 Models

People: David Idea: To setup our Kumubot cluster, I went down a rabbit hole benchmarking different quantization methods for Qwen3 8B and 32B models to see which ones actually deliver the best accuracy-to-speed tradeoff in real-world use Details: • ExLlamaV3-4bpw surprisingly beat even BF16 on LiveBench accuracy (60.0 vs 58.

Benchmarking multi-lingual open-source LLMs

People: David Idea: Based on conversations with Keao about machine-translation benchmarking, I ran a bunch of LLM models through MMLU-ProX (lite) French benchmark tests (biology section + full 14-topic suite) to see which ones actually deliver the best mix of speed and accuracy in French. This started because we hypothesized that

Local hardware fine-tuning LLMs for Hawaiian-English translation benchmarking

People * David Idea Exploring memory-efficient fine-tuning techniques for improving Hawaiian-to-English translation using Apple's MLX framework, comparing multiple approaches and optimizing for Mac hardware. Details * Successfully fine-tuned gemma-3-4b-it-4bit on Mac M1 Ultra (128GB RAM) achieving 0.8296 semantic similarity score, a 3.6% improvement over the base model * Discovered

Fine-tuning performance between Apple and Nvidia

People: * David Idea: * Comparing fine-tuning performance on MacBook M3 Max, Mac Studio M1 Ultra, and Nvidia 4090 using MLX and Unsloth Details: * Tested fine-tuning Phi-3-mini-4k-instruct model * Followed this Jan 2025 MLX guide for Apple hardware * Used Unsloth library for Nvidia GPU * Dataset had 627 examples and used 500 training steps

Benchmarking Runpod cloud GPUs

People David Idea Exploring cloud GPU performance, cost, and usability for running open-source AI models (in comparison to local hardware and in context of recent learning on software to handle concurrent users) Details * Compared RunPod serverless and pods using Nvidia (vLLM) and AMD (sglang) * Benchmarked Nvidia 3090, 4090, RTX 6000

Comparing Onyx vs Morphik (whole PMF Deep Research)

People: * David Idea: * Comparing two open-source projects, Onyx and Morphik, for self-hosted semantic search (and deep research) across Google Drive files (PDFs, Docs, Sheets, and Slides) Details: * Onyx: Great text-based search, flexible embedding models * Morphik: Powerful multi-modal search (text, images, graphs) * Onyx easily connects and syncs with Google Drive * Morphik

Designing Po‘owai with Magicpatterns.com

A Real-World Test of Prompt-to-UI AI Creating a usable interface for a brand-new app often eats up early project time. To see how far AI can shorten that stage, we pointed Magicpatterns.com at a brand idea called Po‘owai, a cash-flow tool for non-profits that have to juggle restricted

From Bolt.ai to Your Laptop in Four Easy Steps

Bringing the Ho‘omo‘o App Home — No Tech Jargon Required You played with Ho‘omo‘o in Bolt.ai’s browser editor and now you’d like to run it on your own computer and save it on GitHub for safekeeping. Good news: it’s basically a three-minute job.

ai

(Post-WIP) How to evaluate inference engines

* People * David * Idea * For a given set of local LLM models that a set of users (or community) wants to use - how do you measure performance/UX on ways to serve it (based on ease of use, speed of processing and generation, level of accuracy, and features like multi-user-context

ai

(Post-WIP) Comparing LLM inference engines (multi-user and multi-model)

* People * David * Idea * For a given set of local LLM models that a set of users (or community) wants to use - what is the best/easiest way to serve it (based on ease of use, speed of processing and generation, and features like multi-context and * Details * Engines like llama.

ai

(Post-WIP) LangGraph Agentic spreadsheet matching/cleanup

* People * David * Idea * Use LangGraph workflows and nodes to collect and process QB export data in comparison to Po'owai DB export rows to generate annotated matching ids (focused on people, funding sources, and projects) * Details * LangGraph agentic workflows * Node-based Quickbooks Desktop Web Connector client * LangChain adapters for reasoning

OLA - Mar. 28th - Leading Self: Huakaʻi Of The Ecological Self

Ref Link: Ola Participant Handbook Theme Between the first and second sessions, leaders will deepen their leadership journey with an executive coach before embarking on a transformative exploration in the second gathering, drawing inspiration from the saga of Hiʻiakaikapoliopele and using traditional and modern mixed art methods to visually articulate

OLA - Feb. 28th - Leading in Community: Wayfinding

Ref Link: Ola Participant Handbook Theme ʻO nā hōkū nō nā kiu o ka lani—The stars are the spies of heaven! This immersive experience, blending land and water, allows leaders to learn both practical and metaphorical lessons from kānaka who sailed on Hōkūleʻa and Hikianalia, honing evaluative, analytical, and

OLA - Jan. 31st - Unearthing Layers of Identity Through Kinolau

Ref Link: Ola Participant Handbook Theme LEADING SELF: UNEARTHING LAYERS OF OUR IDENTITY THRU KINOLAU Participants will embark on a leadership journey by sharing artifacts that represent their identity, making lau lau to explore leadership connections, and immersing in the stories of Nāonealaʻa and Laʻamaikahiki to deepen their sense of

Research: OCR/ICR/LVLM for digitization of text

Description Simplest steps forward to create a searchable ʻŌlelo Noʻeau database. * collect the text: gather all the ʻŌlelo Noʻeau (have the book, would be cool to include audio forms found in ulukau) * OCR will recognize characters and convert them into editable formats like .txt etc. * manual correction: this is important

prototype

Prototyping with v0

Prototype Description A web app / resource for ʻōlelo noʻeau that allows visitors to interact with the text in a dynamic way. Unique Problem, Curiosity, or Goal: These text are a unique resource into lāhui consciousness of the past. These sayings help to define an ʻōiwi mindset, and give insights to

Latest