{"id":"2009347948816335031","url":"https://x.com/aliansarinik/status/2009347948816335031","text":"","author":{"name":"Ali Ansari","username":"aliansarinik","avatarUrl":"https://pbs.twimg.com/profile_images/2017151410253737984/r2KW2Beg_200x200.jpg"},"createdAt":"Thu Jan 08 19:34:17 +0000 2026","engagement":{"replies":243,"retweets":640,"likes":4884,"views":2775056},"article":{"title":"human data will be a $1 trillion/year market","previewText":"human data will be a $1 trillion/year market\nThis is not a short-term prediction. It is a structural claim about where the economy converges.\nTo believe this, you need to accept two assumptions:","coverImageUrl":null,"content":"human data will be a $1 trillion/year market\n\nThis is not a short-term prediction. It is a structural claim about where the economy converges.\n\nTo believe this, you need to accept two assumptions:\n\n- Digital and physical intelligence can eventually automate the tedious parts of the economy\n\n- Self-learning intelligence without human data is impossible at the frontier\n\nautomation is the most useful & liberating thing humanity can do\n\nIf AI systems can automate functions, then automating all functions is the highest-leverage task for humanity.\n\nAutomation compresses time. It allows:\n\n- Aspirations to be fulfilled faster, by orders of magnitude\n\n- Humans to focus on the enjoyable, judgment-heavy parts of work while robots and agents to handle the rest\n\nAs humans gain time, they create more. Net-new work is initially creative and high-value. Over time it becomes legible, repeatable, and ready for automation. Once automated, it continues delivering value while freeing humans to focus on new creative work. This loop is permanent.\n\nAutomation does not eliminate human work. It pushes humans toward higher-value, more creative work.\n\nAt a societal level, automation reshapes the economics of the world. As AI systems take on more production and coordination, the cost of producing goods and services collapses while availability explodes.\n\nAt the same time, distribution becomes increasingly optimal. Digitally and physically intelligent systems coordinate supply and demand with less friction, less waste, and less delay, making access faster, cheaper, and more reliable every year\n\nAI models learn from humans forever\n\nEvery artificially intelligent system learns from humans in some form:\n\n- Demonstrations\n\n- Supervised fine-tuning\n\n- Preference learning\n\n- Complex rubrics and evaluations\n\n- Continual corrections\n\nEven self-play and synthetic data depend on human grounding — humans define objectives, rewards, and what “good” looks like.\n\nAs a result:\n\n- Every function in the economy contains useful learning signal\n\n- Every decision, exception, failure, and tradeoff creates data\n\nBut raw activity is not enough. That data must be:\n\n- Recorded\n\n- Structured\n\n- Evaluated\n\n- Packaged into usable pipelines\n\nAnd importantly, functions must continue running while they are being automated. Automation is iterative, not instantaneous.\n\nthis creates a universal obligation and opportunity\n\nTo iteratively automate functions, every company, government agency, or institution running real operations must consume and produce structured data related to those functions. In most cases, it will not be optimal for them to create or structure that data themselves, due to scale inefficiencies, high fixed costs, and the operational difficulty of producing high-quality, reusable structured data in-house.\n\nWe already see this dynamic today. For example, many lawyers produce more leverage per hour working on standardized, structured legal data through platforms like micro1 than they do performing unstructured work inside individual law firms. At micro1, over 1,000 lawyers work in structured data creation and earn on average ~20% more than in traditional firm roles. Law firms themselves are unlikely to become large-scale producers of structured training data, but they will increasingly be consumers of that data, either directly or by having it embedded in the tools they use.\n\nThis creates a powerful incentive structure.\n\nLabs that are automating functions will pay for this data, because long term the value gained from incremental automation far exceeds the cost of acquiring the data.\n\nAs a result:\n\n- Entities are incentivized to produce high-quality human data not just to automate themselves, but because that data has external market value\n\n- Every hour of work can simultaneously:\n\n- Run the organization\n\n- Train AI models\n\n- Generate additional revenue for the organization\n\nHuman labor becomes not just labor to produce goods & services, but a revenue-generating asset on its own.\n\nthe ultimate convergence: 5%+ of human time is spent on human data\n\nIt’s reasonable to think that most functions in the economy will spend some amount of time trying to automate themselves. Not fully, and not all at once, but continuously pushing work out of the human loop as it becomes repeatable and scalable.\n\nToday, even knowledge workers spend the majority of their time on communication and coordination rather than on what we would consider actual productive work. As automation advances, tedious parts of  knowledge work are progressively removed, and automation increasingly absorbs coordination, scheduling, routing, and routine communication. The result is a larger share of human time being spent on judgment heavy knowledge work.\n\nEven under conservative assumptions, it is reasonable to expect that in a more automated economy roughly 75% of work time is still spent on communication and coordination, while about 25% is spent doing actual work.\n\nNot all of that work needs to be structured. But a meaningful fraction does. Work that produces decisions, judgments, demonstrations, evaluations, and exceptions becomes far more valuable when captured in a structured, reusable form, both to complete the task and to enable future automation. If only one fifth of that actual work is performed in structured environments, that implies roughly 5% of total human labor time is spent generating structured human data.\n\nWith global GDP at roughly $100T, and labor representing about 50% of that, total labor spend is around $50T annually. Five percent of that corresponds to roughly $2.5T per year of human time directed at enabling automation, creating demonstrations, feedback, evaluations, and learning signals for AI systems.\n\nCertainly not all of this will become explicit spend in the human data market. Much of it will remain implicit, fragmented, or unpriced. But even with aggressive discounting, you still arrive at something on the order of $1T per year.\n\nautomation reshapes labor,  it doesn’t shrink it\n\nThis results in automation scaling, As automation scales, some amount of what was spent on human labor is redirected towards:\n\n- Energy\n\n- Compute\n\n- AI labor\n\nHowever, total human labor spend continues to increase.\n\nWhy?\n\nAutomation creates time.\nTime enables creativity.\nCreativity produces net-new functions within the economy.\n\nThose functions are initially done by humans. Over time, they follow the same automation cycle.\n\nhuman labor gets more expensive because:\n\n- Human time is finite at any moment\n\n- Creativity and judgment are scarce\n\n- Net-new ideas command premium value\n\nAs automation expands, humans concentrate more of their time on higher-leverage work. While total human hours do grow over time, that growth cannot be rapidly accelerated in response to demand. The fastest and dominant way the labor market expands is by increasing the value created per human hour.\n\nAs this continues:\n\n- Total human labor spend rises\n\n- A larger share of human time is spent generating learning signals and enabling automation\n\nwe should never call it annotation again\n\nThe importance of this work in shaping AI means calling it “data labeling” or “annotation” is completely inaccurate. These phrases describe mechanical tasks, when the real value comes from human judgment, expertise, and decision-making expressed in structured form.\n\nA more accurate description is expert human data creation or structured human judgment.\n\nThis is how human expertise compounds in an automated economy. It explains why human data scales with automation rather than disappearing, and why it becomes a first-class economic input over time.\n\nhuman brilliance is needed more than ever\n\nThis does not require extreme assumptions. It only requires that automation continues to work, and that intelligence continues to learn from humans. If that is true, then human data is not a phase or a temporary bottleneck. It is a structural input to the economy.\n\nHuman judgment is captured, structured, and refined.\n\nThat judgment becomes the training substrate of intelligence.\n\nThat intelligence, in turn, produces more automation.\n\nAs functions are automated, human time is freed. That time is spent creating new functions to automate, and the beautiful cycle continues."},"adhxContext":{"savedByCount":1,"publicTags":[],"previewUrl":"https://adhx.com/aliansarinik/status/2009347948816335031"}}