AI Developmentr/LocalLLaMA

I classified 3.5M US patents with Nemotron 9B on a single RTX 5090 — then built a free search engine on top

Read original
ai-coding-toolslocal-llm-workflowsdomain-specific-aihybrid-search-architecture

Patent attorneys need exact phrase matching. 'solid-state battery electrolyte' should match those exact words, not semantically similar documents about 'energy storage.' FTS5 gives sub-second queries on 3.5M records with zero external dependencies.

Key takeaways

  • Local LLM (Nemotron 9B on RTX 5090) classified 3.5M patents in 48 hours, demonstrating consumer hardware can handle enterprise-scale classification tasks
  • Contrarian architecture choice: FTS5 full-text search outperforms vector embeddings for domain-specific use cases requiring exact phrase matching and deterministic results
  • Complete technical stack disclosed: SQLite FTS5 + local LLM query expansion + BM25 ranking with custom weights + FastAPI, hosted on Chromebook via Cloudflare Tunnel - proving production-grade search doesn't require cloud infrastructure
  • Patent lawyer with 1 month coding experience built production search engine, signaling democratization of AI tooling for domain experts without traditional engineering backgrounds
  • Hybrid approach: Uses LLM for natural language query expansion into boolean queries, then traditional search for retrieval - combining strengths of both paradigms

Why this matters for operators: Legal tech, enterprise search architecture, local LLM deployment for specialized domains, hybrid search strategies

I cover AI×GTM intelligence like this every Wednesday.

Get STEEPWORKS Weekly

More picks

Enterprise AIMIT Technology Review AI

Rebuilding the data stack for AI

  • Enterprise AI adoption is bottlenecked by fragmented, ungoverned data infrastructure rather than AI model capabilities
  • Competitive differentiation comes from proprietary data combined with third-party enrichment, not just AI tools
  • Evolution from 'system of engagement' to 'system of action' represents shift toward autonomous AI agents managing workflows
data-infrastructureenterprise-ai-readinessai-governance
Enterprise AIDemand Gen Report

Gartner: Explainable AI Will Drive LLM Observability Investments

  • LLM observability adoption will jump from 15% to 50% of GenAI deployments by 2028, driven by explainability requirements for scaling beyond low-risk use cases
  • Traditional IT observability (latency, cost) is insufficient - new metrics needed include hallucination detection, factual accuracy, logical correctness, and sycophancy measurement
  • Gartner recommends XAI tracing for high-impact use cases, multidimensional observability platforms, and continuous evaluation frameworks with human-in-the-loop validation
ai-policyregulatory-impactmarket-consolidation
AI DevelopmentLenny's Newsletter

From a $6.90 newsletter to $3M API: How a non-coder built Memelord | Jason Levin

  • Non-technical founder scaled from $6.90/month newsletter to $100K ARR using Bubble (no-code), then raised $3M to build API-first product - validates no-code as legitimate path to venture scale
  • Mandatory 'vibe-coding' rule for marketing team - employees must build their own AI tools/automations, representing shift from using AI to building with AI as core marketing skill
  • Free AI tools as lead gen replacing traditional content - 'free tools are the new PDF downloads' generated hundreds of thousands of emails, signaling evolution in PLG motion
ai-coding-toolsautomation-stacksplg-to-sales

This analysis was produced using the STEEPWORKS system — the same agents, skills, and knowledge architecture available in the GrowthOS package.