The plain-language summary
A short, human paragraph that explains what the record is — readable in about five seconds, no jargon, no skimming a PDF.
The connector is the easy part. The hard part is everything that happens next — to every email, PDF, photo, statement, telemetry stream, and voice note you let in. Seven stages, each one inspectable, each one reproducible. The pipeline is what turns a Tesla session, a 14-line HELOC statement, or a parking-lot voice note into a typed record the Provenance Ledger will sign and keep forever.
Most of your life already lives in files — but a PDF in a folder doesn't know what it is, who it's about, or when it matters. The pipeline is the part of Lossless that turns a pile of documents into something you can actually ask questions of.
Under the hood there are 51 individual steps across 7 stages. You don't need to think about any of them — but you should be able to see them. Tap a stage to look inside.
Nothing sits in a folder waiting to be noticed. The instant a file comes in — from your inbox, your drive, an upload, a voice note — Lossless gives it a stable address and opens a record for it. From here on, it can't get lost.
A scanned receipt isn't text until someone reads it. Lossless reads everything: the words on a page, the words inside a photo, what's spoken in an audio note, what happens in a video. If it's in another language, it's translated. It even reads the quiet details — when a photo was taken, where, and on what device — and picks out the people, businesses, amounts, orders, and dates along the way.
Now the file is understood, not just transcribed — what it's about, who it concerns, why it matters, and what (if anything) you need to do about it. Lossless quietly connects the new file to the ones it belongs with: the confirmation email, the earlier statement, the right person, the right property. The pile starts becoming a story.
Every record gets a short, plain-language summary and a one-line brief — the kind of thing you'd jot on a sticky note. It's categorized — filed into the taxonomy of domains, document families, and spend categories. And if something's missing or thin, Lossless goes back and fills the gap rather than leaving a hole.
A record is only useful if it shows up when you need it. Lossless tags each one with the topics and the bigger themes it belongs to — drawn from a three-tier map of your life that the system builds for you — so a question three years from now still finds the answer.
This is the part the name is about. The finished record is kept in several places at once, made searchable by meaning rather than just keywords, woven into the graph of people and places and things you own, and sealed with its provenance — payload, origin, seal. The original file is never touched. Nothing gets lost.
Before Lossless calls a record done, one more pass looks it over — checking for anything thin, missing, or off — and fixes it. The bar is simple: would this hold up if you actually had to rely on it?
7 STAGES · 51 STEPS · RUNS IN THE BACKGROUND · YOU NEVER PRESS A BUTTON
Open any record — or any upload batch — and there's an AI Pipeline tab. Every step, timed. Every file in a batch, mapped. Nothing hidden behind a spinner.
The Overview tab shows the AI summary, the one-line brief, and every extracted entity — all six of which are covered further down this page.
Every cell is one file. Hover to inspect. A batch of years of paperwork runs the whole 51-step pipeline, file by file — and tells you exactly which five need a human glance.
The Entities tab lists every person, business, place, amount, date, topic, and action item the pipeline pulled — each one tappable, each one linked back to the source.
The pipeline treats a voice memo and a nine-page bank statement with the same care. Choose one and see.
"Categorized" isn't a vague promise here. There's a real, structured taxonomy underneath — domains, record types, document families, and a deep spend map — and every file is routed into it.
Every record belongs to one. Together they hold 130+ record types and 25+ sub-types.
+ Memory & Knowledge · Projects & Action Items · Topics & Pulse · Photos & Media · AI Chat · Device Sync · System — 19 domains in all
A second axis, running across the domains: the document type, grouped into eight families.
When a record involves money, it's filed down to the leaf. Tap a category to see how deep it goes.
+ Groceries & Food · Housing · Digital Services · Shopping & Retail · Travel · Transit · Education · Childcare · Charity & Gifts · Professional Services · Beauty & Personal Care — 21 in all
A short, human paragraph that explains what the record is — readable in about five seconds, no jargon, no skimming a PDF.
An entity-first headline with the significant stuff flagged — the deadline, the amount, the thing you'd actually want to know first.
Routed into the taxonomy — a domain, a record type, a document family, and a spend category down to the sub-category.
Every person, business, place, amount, and date it touched — linked, so you can walk from a person to a property to an account.
The throughlines of your life, tagged onto the record across a three-tier topic map — so a question years later still surfaces it.
Payload, origin, seal. Every finished record carries proof of where it came from — so any agent you authorize can trust it.
"I connected eighteen years of Gmail and a folder of scanned paperwork I'd been avoiding. A day later it was all just… records — summarized, dated, filed under the right category. I didn't lift a finger. The pipeline did the part I'd been dreading for a decade."
"Reproducibility at the parser level is the part the security auditor actually cares about — re-run the source, re-derive the record, the signature still verifies."
— What the technical evaluator writes in their report
Next pillar: the schema-aware connectors that hand the pipeline its raw material — Gmail receipts, Plaid statements, Tesla telemetry, iMessage threads — each one parsed into a typed record, not a blob.