📱

Oops! Screen too small

This deck flexes best on a bigger display.
Grab a tablet or laptop and we'll save your seat! 🚀

Osmosis Tech Stack
1 / 10
Published: 7/3/25

We grab the signal before it vanishes.

Real-time search for the unindexed web, verifiable and traceable to the source.

While legacy vendors index what's already public, Osmosis captures what's about to disappear—and delivers it while you can still act on it.

Identify
Capture
Understand
Alert
Osmosis Tech Stack
2 / 10

Complete Tech Stack Overview

📊 Data Sources

Pension Funds

PE Firms

Russell 3000

Podcasts & Media

🔄 Ingestion Pipeline

Discovery:
Firecrawl, Jina.ai
PodcastIndex API
Capture:
Meeting Bots
GCP Storage
Process:
Deepgram/Whisper
Google Gemini
Analyze:
GPT-4o, o3
BERT NER

🧠 Intelligence Layer

Knowledge Graph: PostgreSQL → Neo4j

Search: Elasticsearch

Embeddings: OpenAI + Elastic

Continuous Learning

📱 Delivery

Web App:
Next.js/React

API:
Python/Flask

Alerts:
AI + AWS SES

☁️ Infrastructure

GCP Cloud Run Digital Ocean → GKE Vercel Celery Orchestration
Osmosis Tech Stack
3 / 10

1. Dynamic Knowledge Graph

Living Map That Grows

  • Every fund, allocator, exec, and recurring meeting
  • Dynamically adds new orgs/speakers as encountered
  • Enriches via org websites & conversation channels
  • 5,000+ allocators target by Sept 2025

Intelligent Orchestration

The graph tells us who matters, where they meet, and when they speak

Dispatches agentic crawlers to "ghost-attend" thousands of live events

PostgreSQL → Neo4j OpenAI Responses API DiffBot API Wikidata API
🧠

Self-improving system

Not just seeding—continuously discovering

Osmosis Tech Stack
4 / 10

2. Capturing Millions of Events

⚡ 2.6M+ Public Events Annually

When any of these disappear, we've already archived them. Competitors can't backfill what's gone.

Government & Regulatory

1.5-2M

Local meetings alone

Earnings Calls

50K+

Global public companies

Finance Content

35K+

Podcasts, webinars, conferences

Agentic Infrastructure

Ghost-attend via Zoom, livestreams, telephony

Redundancy across sources Advanced de-duplication

Enrichment Channels

Monitor all channels: websites, docs, conversations

Firecrawl Jina.ai GCP Cloud Run
Osmosis Tech Stack
5 / 10

3. Intelligent Content Capture

Live Events

Automated meeting bots join and capture in real-time

Custom Twilio Agent Fireflies Recall.ai Custom Zoom Agent

Media & Documents

Podcasts, videos, and documents archived with metadata

GCP Cloud Storage Metadata Extraction
🎯

Real-time Capture

Zero content missed

Osmosis Tech Stack
6 / 10

4. Audio-Specific NLP Pipeline

🎙️ Built for Spoken Chaos

1. Diarization

Who's speaking when

2. Speaker ID via Graph

Nate Weinstein at Osmosis, not "analyst"

This alone is truly difficult—impossible without KG

3. Delta Detection

What changed since last quarter

📄 Multi-Modal Documents

First-Class Citizens

Board books, meeting minutes, PDFs

Intelligent Narration

Google Gemini for deeper context

Small, deterministic models → Fast, cheap, and resilient

Osmosis Tech Stack
7 / 10

5. Pull Out the "So What"

Progressive Intelligence

LLMs flag mandates, sentiment shifts, and changes

OpenAI GPT-4o See change, not noise

Know Who's Talking

Tag every quote to the right allocator, fund, or regulator

BERT-Large NER Knowledge Graph Linking

Delta Analysis (NEW)

Track narrative shifts across time

You can't replay a meeting that no longer exists

OpenAI o3 FAISS Historical Archive
"Real-time intelligence, tailored to you."
Not another inbox dump

Every click and comment feeds back to the ranking engine—next alert is even sharper

Osmosis Tech Stack
8 / 10

6. Compound Defensibility Engine

🔄 Feedback Flywheel

Every click sharpens ranking → more usage → sharper ranking → compound moat

📚

Historical Archive

Grows daily—others start at zero

Can't backfill vanished data

🔍

Lightning Search

Semantic + traditional retrieval

Elasticsearch OpenAI Embeddings
🎯

User Queries

Tune answers to every use case

66%+ email open rates

Osmosis Tech Stack
9 / 10

7. Slot Straight Into Your Workflow

⚡ Push Alerts While Actionable

Configure topics once—get pinged within seconds

OpenAI o3 AWS SES SMS & In-App (next)

🔌 Enterprise Integrations (future)

Send insights with a click

Salesforce DealCloud Backstop

🚀 Enterprise API Suite

Production-ready REST API (Live at Marshall Wace)

Platform Revenue OEM Partnerships
🎯

Real-time intelligence

for capital raisers who can't afford to be late

Osmosis Tech Stack
10 / 10

Why Incumbents Can't Catch Up

Legacy players index what's already published. Osmosis captures what's about to disappear.

🌐 The Unindexed Web

Zoom Rooms • Livestreams • Meeting Portals • Documents that vanish

🧠 Dynamic KG

Who matters, where they meet

👻 Ghost-Attend

Agentic crawlers capture

🎙️ Audio NLP

Built for spoken chaos

🔒 Compound Moat

Historical Archive (can't backfill) Feedback Flywheel (gets smarter) Delta History (unique insights)

🚀 Enterprise API

Live at Marshall Wace

⚡ Real-time Alerts

66%+ open rates

🔌 Integrations

Salesforce, DealCloud

For incumbents, replicating Osmosis isn't building a feature. It's rebuilding their entire stack.