Every knowledge worker produces a continuous stream of intellectual output — articles, analyses, reports, presentations, design documents, research memos, code — that disperses across platforms and disappears. The journalist's archive lives on whatever CMS the publication used, until the publication is acquired, redesigned, or goes under. The consultant's project deliverables live in a client folder on a shared drive that gets migrated three times and then archived. The researcher's papers live on a university repository that may or may not maintain DOI persistence across institutional restructuring. The teacher's curriculum resources live in a school cloud storage folder tied to an employment account. The common thread is that the work exists as long as the institutional host exists and is willing to maintain the hosting relationship. When the relationship ends — because the person moves, the platform closes, or the institution changes its systems — the work becomes inaccessible, unattributable, and often lost.
This is not a new problem, but it's a problem that has become more acute as knowledge work has migrated increasingly to cloud platforms with opaque data ownership terms. The SaaS business model that works well for workflow tools is poorly aligned with the interests of knowledge workers who produce the content those platforms host. A CMS that owns the distribution relationship with readers has different incentives than the writer who needs their work to remain findable, attributable, and accessible regardless of what happens to the platform. The terms of service of most content platforms are clear that the platform controls the hosting relationship and can modify the access terms or shut down at any time with limited notice. This is legally standard but practically problematic for anyone trying to maintain a persistent, owned record of professional intellectual output.
The tools that address this problem effectively need to do three things: archive the content in a format that survives platform changes; make it findable and navigable through the kind of semantic indexing that reflects how people actually think about their own work; and provide an ownership and attribution layer that is verifiable in contexts where the original platform no longer exists. The first requirement is engineering. The second requires genuine intelligence about content structure and subject matter. The third is an institutional and credentialing problem as much as a technical one — the value of a verified attribution record depends on whether the verification mechanism is trusted by the people evaluating it.
Authory, which we backed in 2024, is solving the first two requirements with particular rigour for writers and journalists: continuous archiving from publication sources, semantic organisation of the archive, and a portfolio surface that the author controls rather than the publisher. The credentialing layer is the harder long-term problem, because it requires the tool to be trusted not just by the knowledge worker but by the people evaluating the knowledge worker's output — employers, academic institutions, professional bodies, licensing authorities. We think the credentialing layer will be built by companies operating in the institutional space first — the portable credential and micro-credential infrastructure that post-secondary institutions are developing — and that knowledge portfolio tools for individuals will integrate with those credentialing layers as they mature rather than building parallel verification infrastructure.
The total addressable market for knowledge portfolio tools is genuinely large and underappreciated. The global knowledge worker population — loosely defined as people whose primary work output is information, analysis, or professional judgment rather than physical goods — is in the hundreds of millions. The tools they currently use to manage their professional intellectual history are a combination of LinkedIn profiles, personal websites, PDF folders, and memory. None of those are adequate for the kind of persistent, verified, semantically rich professional record that the knowledge economy increasingly requires. The companies that solve this problem well won't be selling a productivity tool; they'll be building the professional identity infrastructure that a mobile, platform-switching knowledge worker population needs to maintain their intellectual capital across a working life.