(Post-WIP) AI Agents for Browser Use
- People
- David
- Joe
- Idea
- Test available AI Browser Use tools - "can an LLM use my browser to order me Taco Bell?"
- Details
- OpenAI Operator
- Open-source: browser-use
People: * David Idea: * Comparing fine-tuning performance on MacBook M3 Max, Mac Studio M1 Ultra, and Nvidia 4090 using MLX and Unsloth Details: * Tested fine-tuning Phi-3-mini-4k-instruct model * Followed this Jan 2025 MLX guide for Apple hardware * Used Unsloth library for Nvidia GPU * Dataset had 627 examples and used 500 training steps
People David Idea Exploring cloud GPU performance, cost, and usability for running open-source AI models (in comparison to local hardware and in context of recent learning on software to handle concurrent users) Details * Compared RunPod serverless and pods using Nvidia (vLLM) and AMD (sglang) * Benchmarked Nvidia 3090, 4090, RTX 6000
People: * David Idea: * Comparing two open-source projects, Onyx and Morphik, for self-hosted semantic search (and deep research) across Google Drive files (PDFs, Docs, Sheets, and Slides) Details: * Onyx: Great text-based search, flexible embedding models * Morphik: Powerful multi-modal search (text, images, graphs) * Onyx easily connects and syncs with Google Drive * Morphik
A Real-World Test of Prompt-to-UI AI Creating a usable interface for a brand-new app often eats up early project time. To see how far AI can shorten that stage, we pointed Magicpatterns.com at a brand idea called Po‘owai, a cash-flow tool for non-profits that have to juggle restricted