Synthesis ⚗️

Synthesis turns video, audio, and text into a structured, searchable knowledge base — running entirely on your own machine, with no data sent to external services.

Paste a YouTube URL (a lecture, a conference talk, a clinical interview) and Synthesis extracts the key insights, links them to related ideas you've already captured, and lets you search across everything - by keyword or meaning.

Who is this for?

Clinicians and clinical teams navigating complex, information-dense cases — without sending patient-relevant material to third-party AI services
Developers and researchers who want a local-first, extensible knowledge distillation pipeline they can build on

What problem does it solve?

Complex cases - unexplained diagnoses, treatment-resistant conditions, multi-system presentations - generate enormous amounts of information spread across consultations, literature, and guidelines. Synthesis helps you build a persistent, connected knowledge base from that material, so you're not re-deriving the same synthesis every time you return to a case.

Unlike general AI assistants (ChatGPT, Copilot, Claude), Synthesis runs on your own infrastructure. Nothing leaves your machine.

How it works

Ingest - paste a YouTube URL, audio file, or text source (coming soon)
Transcribe - transcript is extracted automatically
Distil - a local AI model extracts atomic insights and a summary
Store - insights are saved with semantic embeddings in a local database
Search - query by keyword or meaning across everything you've captured

Privacy & data sovereignty

Synthesis is local-first by design, and supports approved enterprise AI providers:

Runs locally - all processing via Ollama on your own machine; no API keys, no data egress, no cloud dependency
Azure OpenAI - planned support (not yet implemented; see roadmap below)
Suitable for information-governance-constrained environments (NHS trusts, public sector organisations)

⚠️ Synthesis does not process patient-identifiable data and is not a medical device. It is a knowledge management tool for educational and research use.

Get started

For clinicians and non-technical users (coming soon)

A one-click installer requiring no coding is on the roadmap. In the meantime, Synthesis can be set up with a small amount of terminal use — the steps below take around 10–15 minutes and only need to be done once.

Follow the developer setup below, then use:

./synthesis https://www.youtube.com/watch?v=<id>
./synthesis --search "your query here"

We'd love feedback from clinical users on what would make this easier. Open an issue or get in touch.

For developers

1. Install Elixir

The easiest cross-platform way is Mise:

# macOS / Linux
curl https://mise.run | sh
mise use --global elixir@latest erlang@latest

# Windows
winget install jdx.mise
mise use --global elixir@latest erlang@latest

2. Install yt-dlp and Ollama

pip install yt-dlp

Download Ollama from ollama.com/download, then:

ollama pull qwen3.6:27b
ollama pull qwen3:8b           # optional lighter alternative
ollama pull qwen3-embedding:8b
ollama serve

### 3. Clone and run

```bash
git clone https://github.com/The-Strategy-Unit/synthesis
cd synthesis
mix deps.get
mix wiki.add https://www.youtube.com/watch?v=<id>
mix wiki.search "your query"

Configuration

Configuration options

Edit config/config.exs or set environment variables via .env.

Setting	Default	Notes
`ollama_url`	`http://localhost:11434`	Change if Ollama runs on a different host
`ollama_model`	`qwen3.6:27b`	Lighter alternative: `qwen3:8b`
`ollama_model_embed`	`qwen3-embedding:8b`
`chunk_concurrency`	`2`	Parallel chunks for long transcripts
`single_chunk_threshold`	`2500` tokens	Below this, processed in one call
`max_retries`	`3`	Retry attempts on LLM failure
`output_dir`	`output/`	Where markdown notes are written
`db_path`	`synthesis.db`	SQLite database location

Project structure

Codebase layout

lib/synthesis/
  synthesis.ex     # Top-level orchestrator
  cli.ex           # CLI entrypoint
  fetcher.ex       # yt-dlp wrapper
  extractor.ex     # LLM client - insight extraction
  chunker.ex       # Long transcript splitter
  embedder.ex      # Embeddings client
  store.ex         # DB reads/writes
  writer.ex        # Markdown/Obsidian output
  queue.ex         # Pipeline GenServer
  repo.ex          # SQLite GenServer
  migrations.ex    # Schema runner
  application.ex   # OTP supervisor

mix/tasks/
  wiki.add.ex      # mix wiki.add <url>
  wiki.search.ex   # mix wiki.search <query>

priv/migrations/   # SQL schema files
output/            # Generated notes (gitignored)

Roadmap

Synthesis is actively developed. Current priorities:

Web UI - browser-based interface for non-technical users
Bundled binary - single download, no prerequisites
In-app knowledge graph - replace Obsidian dependency with a built-in navigator
Multi-user support - submission queue and role-based access
Azure deployment option - for organisations that need hosted infrastructure

Contributing

Contributions are welcome - particularly from clinicians who can help shape how Synthesis handles complex medical knowledge.

Licence

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
config		config
lib		lib
priv/migrations		priv/migrations
test		test
.env.example		.env.example
.formatter.exs		.formatter.exs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mix.exs		mix.exs
mix.lock		mix.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Synthesis ⚗️

Who is this for?

What problem does it solve?

How it works

Privacy & data sovereignty

Get started

1. Install Elixir

2. Install yt-dlp and Ollama

Configuration

Project structure

Roadmap

Contributing

Licence

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Synthesis ⚗️

Who is this for?

What problem does it solve?

How it works

Privacy & data sovereignty

Get started

1. Install Elixir

2. Install yt-dlp and Ollama

Configuration

Project structure

Roadmap

Contributing

Licence

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages