Prediction Tracker
55 predictions across 11 categories. Updated daily via Bayesian likelihood ratio analysis with a 10-persona forecaster panel.
How to read predictions
Bets AGAINST the consensus view. A 30% contrarian prediction means we think there's a 30% chance the crowd is wrong.
Aligns with the mainstream view. Higher confidence = stronger agreement with conventional wisdom.
Resolves within 1-2 years. These build our track record — you'll know soon if we were right or wrong.
34% By 2030, AI coding tool revenue growth will have plateaued below 15% YoY, resembling the RPA hype cycle
CONTRARIAN AI Coding 2030 CONTESTED
By 2030, AI coding tool revenue growth will have plateaued below 15% YoY, resembling the RPA hype cycle
Reasoning
Contrarian: AI coding tools may hit a ceiling on what they can reliably automate. Like RPA in 2018-2022, initial excitement gives way to disillusionment as maintenance costs and hallucination-driven bugs accumulate. Consensus says this market grows forever.
Resolution criteria
Resolves YES if the combined revenue growth rate of the top 5 AI coding tool providers (GitHub Copilot, Cursor, Codeium, Tabnine, Amazon CodeWhisperer or successors) falls below 15% YoY as reported in earnings calls or credible market research by end of 2030.
19% Junior developer hiring at FAANG companies will be HIGHER in 2029 than in 2024, not lower
CONTRARIAN Labor 2029
Junior developer hiring at FAANG companies will be HIGHER in 2029 than in 2024, not lower
Reasoning
Contrarian: AI makes junior devs far more productive, so companies hire MORE of them at lower cost rather than fewer. The 'AI replaces juniors' narrative ignores Jevons paradox — cheaper labor input means more demand for it, not less. Consensus says junior hiring collapses.
Latest evidence (LR: 0.50)
Oracle's massive 30k job cuts signal a structural hollowing out of traditional engineering roles
Resolution criteria
Resolves YES if aggregate entry-level software engineering headcount at Meta, Apple, Amazon, Netflix, Google (or successors) in 2029 exceeds their 2024 levels, per LinkedIn Workforce Reports, earnings disclosures, or credible third-party analysis.
47% The median salary premium for 'AI engineering' skills will fall below $10K by 2029 as AI fluency becomes table stakes
CONTRARIAN Compensation 2029
The median salary premium for 'AI engineering' skills will fall below $10K by 2029 as AI fluency becomes table stakes
Reasoning
Contrarian: When everyone can use AI tools, AI fluency stops being a differentiator and the premium collapses — just like 'internet skills' in 2005. Consensus says AI-fluent engineers command massive premiums indefinitely.
Latest evidence (LR: 1.20)
AI fluency is becoming table stakes as 'learning by doing' with AI tools is required to maintain relevance.
Resolution criteria
Resolves YES if Levels.fyi, Glassdoor, or a major compensation survey (n>5000) shows <$10K median total comp difference between 'AI engineer' and 'software engineer' roles at comparable experience levels by end of 2029.
17% -3 Open-source AI models will LOSE market share in enterprise coding tools to proprietary models between 2026 and 2030
CONTRARIAN AI Coding 2030
Open-source AI models will LOSE market share in enterprise coding tools to proprietary models between 2026 and 2030
Reasoning
Contrarian: Enterprises care about liability, support, and integration — not model weights. As coding agents handle more critical workflows, enterprises will pay the proprietary premium for contractual guarantees. Open-source wins the hobbyist market but loses the enterprise. Consensus says open-source inevitably wins.
Latest evidence (LR: 0.50)
Fractional node splitting tools make open-weights models like DeepSeek V3 highly accessible, helping developers avoid proprietary lock-in.
Resolution criteria
Resolves YES if enterprise AI coding tool market share (by revenue) for proprietary-model-based tools exceeds their 2026 share, per Gartner, IDC, or equivalent analyst report published by 2031.
31% AI-generated code will have a LOWER average CVE density than human-written code by 2029
CONTRARIAN Security 2029 CONTESTED
AI-generated code will have a LOWER average CVE density than human-written code by 2029
Reasoning
Contrarian: AI models trained on vulnerability databases will internalize secure patterns faster than humans learn them. The 'AI code is insecure' narrative is based on 2023-2025 benchmarks; models improve while human error rates stay constant. Consensus says AI makes code less secure.
Latest evidence (LR: 0.80)
Warnings about existential security risks from integrating AI-generated extensions suggest AI code may harbor more vulnerabilities, not fewer
Resolution criteria
Resolves YES if a peer-reviewed study or major security vendor report (Snyk, Veracode, Synopsys) finds AI-generated code has fewer CVEs per KLOC than human-written code in equivalent production codebases by end of 2029.
47% Total global software developer headcount will be HIGHER in 2030 than in 2025
CONTRARIAN Labor 2030
Total global software developer headcount will be HIGHER in 2030 than in 2025
Reasoning
Contrarian: Every wave of developer productivity tools (compilers, IDEs, cloud, open-source) has increased total developer employment. AI is a productivity multiplier, not a replacement. More software gets built, creating more jobs. Consensus says mass developer unemployment.
Latest evidence (LR: 0.60)
Oracle's mass domestic layoffs paired with aggressive H-1B filings suggest overall engineering headcount growth is being actively suppressed by cost-cutting and automation.
Resolution criteria
Resolves YES if Bureau of Labor Statistics (US), Eurostat, or a credible global estimate (Evans Data, Statista) shows total software developer employment in 2030 exceeding 2025 levels.
64% By 2028, at least 3 major companies will have publicly rolled back autonomous AI coding agent deployments due to quality/cost issues
CONTRARIAN Automation 2028
By 2028, at least 3 major companies will have publicly rolled back autonomous AI coding agent deployments due to quality/cost issues
Reasoning
Contrarian: Autonomous agents sound great in demos but create unmaintainable code at scale. Technical debt from AI-generated code will force high-profile rollbacks, similar to microservices backlash. Consensus says autonomous agents only expand from here.
Latest evidence (LR: 1.50)
Developer fatigue with AI marketing and reality checks on fully autonomous coding agents increase the probability of rollbacks
Resolution criteria
Resolves YES if 3+ companies with >1000 engineers publicly announce scaling back, pausing, or discontinuing autonomous AI coding agent programs, documented in press releases, blog posts, or credible journalism by end of 2028.
61% +3 China will produce the leading AI coding tool by market share outside the US by 2029
CONTRARIAN Geopolitics 2029
China will produce the leading AI coding tool by market share outside the US by 2029
Reasoning
Contrarian: DeepSeek, Alibaba, and ByteDance are investing massively in code models. China has more developers than the US. Export controls push China to build independent tooling that captures the non-US market. Consensus focuses exclusively on US-based tools.
Latest evidence (LR: 1.50)
The democratization of DeepSeek V3 access accelerates Western adoption of Chinese frontier models.
Resolution criteria
Resolves YES if a Chinese-origin AI coding tool (by a China-headquartered company) holds the largest market share by MAU or revenue in any 3+ non-US countries with >100K developers, per credible analyst report by end of 2029.
69% Natural language will NOT replace code as the primary software specification medium by 2032
CONTRARIAN Workflow 2032
Natural language will NOT replace code as the primary software specification medium by 2032
Reasoning
Contrarian to the contrarians: despite AI hype, natural language is fundamentally ambiguous. Formal languages exist because precision matters. Engineers will still write code — just different code (prompts-as-code, constraint specs, test harnesses). The 'end of programming' narrative is wrong. Consensus says natural language replaces code.
Latest evidence (LR: 1.30)
Interacting with AI is becoming a technical systems engineering task rather than simple natural-language prompting
Resolution criteria
Resolves YES if >50% of production software artifacts at Fortune 500 companies are still primarily specified in formal programming languages (not natural language prompts) as of 2032, per developer surveys or enterprise tooling reports.
41% The 'vibe coding' movement will produce a mass wave of catastrophic production failures by 2028, triggering regulatory intervention
CONTRARIAN Regulation 2028
The 'vibe coding' movement will produce a mass wave of catastrophic production failures by 2028, triggering regulatory intervention
Reasoning
Contrarian: Non-engineers using AI to build production systems they do not understand will cause failures in healthcare, finance, or infrastructure. This triggers a backlash — professional licensing for software, mandatory code audits, or AI-code liability laws. Consensus says vibe coding democratizes development harmlessly.
Latest evidence (LR: 1.20)
Rising concerns over existential security risks of AI-generated code in legacy architectures slightly increase the risk of catastrophic failures
Resolution criteria
Resolves YES if any G7 country passes legislation or regulatory rule specifically addressing AI-generated code quality, liability, or mandatory review requirements by end of 2028.
67% Cursor, the company, will not exist as an independent entity by 2028 (acquired or defunct)
CONTRARIAN Market 2028
Cursor, the company, will not exist as an independent entity by 2028 (acquired or defunct)
Reasoning
Contrarian: IDE-layer AI tools are a thin wrapper over foundation models. When model providers ship native IDE features (which they already are), standalone AI IDE companies get squeezed. Cursor's $50B valuation is the peak, not the floor. Consensus says Cursor becomes the next big platform.
Latest evidence (LR: 1.50)
Massive compute costs and OpenAI's acquisition of TBPN indicate independent AI wrappers face severe financial viability threats.
Resolution criteria
Resolves YES if Anysphere (Cursor's parent) is acquired by another company, merges, or ceases operations by end of 2028. Resolves NO if it remains independent and operational.
68% +5 By 2030, >30% of professional developers will primarily use AI tools from a NON-US company (DeepSeek, Mistral, etc.)
CONTRARIAN Geopolitics 2030
By 2030, >30% of professional developers will primarily use AI tools from a NON-US company (DeepSeek, Mistral, etc.)
Reasoning
Contrarian: US dominance in AI tooling is not guaranteed. DeepSeek R1 already matches frontier US models at a fraction of the cost. European and Asian developers may prefer local alternatives for sovereignty, cost, and latency reasons. Consensus assumes US tools win globally.
Latest evidence (LR: 2.50)
New node-splitting models remove hardware barriers for Chinese models like DeepSeek V3, explicitly accelerating their adoption among developers.
Resolution criteria
Resolves YES if a global developer survey (n>10000) or credible market analysis shows >30% of professional developers primarily using AI coding tools built by non-US-headquartered companies by end of 2030.
32% The senior engineer shortage narrative will prove wrong: AI will make senior-level judgment MORE accessible, not scarcer, by 2030
CONTRARIAN Labor 2030
The senior engineer shortage narrative will prove wrong: AI will make senior-level judgment MORE accessible, not scarcer, by 2030
Reasoning
Contrarian: AI doesn't just replace junior work — it codifies senior patterns. Architecture review, system design, and debugging heuristics get embedded in AI tools, democratizing expertise. The shortage never materializes because AI fills the gap. Consensus says senior engineers become the bottleneck.
Latest evidence (LR: 0.70)
Capital inflows to aerospace draining senior engineering talent implies the senior shortage will persist rather than be solved by AI
Resolution criteria
Resolves YES if 'senior software engineer' roles do NOT appear in the top 10 hardest-to-fill tech positions in Hays, Robert Half, or equivalent staffing firm reports for 2029 or 2030.
59% At least one major AI lab (OpenAI, Anthropic, Google DeepMind) will suffer a sustained >6 month plateau in coding benchmark performance before 2028
CONTRARIAN AI Capability 2028
At least one major AI lab (OpenAI, Anthropic, Google DeepMind) will suffer a sustained >6 month plateau in coding benchmark performance before 2028
Reasoning
Contrarian: Scaling laws have limits. Training data for code is finite. The next jump in coding ability may require architectural breakthroughs that don't arrive on schedule. A plateau would deflate the entire 'AI replaces programmers' narrative. Consensus says capabilities only go up.
Latest evidence (LR: 1.40)
Theoretical shifts away from hardware scaling and strict usage limits on Claude Opus suggest brute-force scaling laws may be hitting a computational wall.
Resolution criteria
Resolves YES if any top-3 AI lab's flagship model shows <2% improvement on SWE-bench, HumanEval, or equivalent coding benchmarks across two consecutive major model releases spanning >6 months, before end of 2028.
50% The biggest winner from AI coding tools by 2030 will be non-tech companies (banks, manufacturers, retailers), not tech companies
CONTRARIAN Market 2030
The biggest winner from AI coding tools by 2030 will be non-tech companies (banks, manufacturers, retailers), not tech companies
Reasoning
Contrarian: Tech companies already have great developers. AI coding tools give the biggest marginal improvement to companies with weak engineering orgs — banks building internal tools, manufacturers automating supply chains, retailers customizing e-commerce. The productivity gain accrues to the laggards, not the leaders. Consensus focuses on how AI helps Silicon Valley.
Resolution criteria
Resolves YES if McKinsey, BCG, or equivalent consultancy publishes analysis showing non-tech-sector companies captured more economic value (cost savings + revenue) from AI coding tools than tech-sector companies by 2030.
45% By end of 2027, GitHub Copilot will lose its #1 market share position in AI coding assistants to a competitor
NEAR-TERM AI Coding 2027
By end of 2027, GitHub Copilot will lose its #1 market share position in AI coding assistants to a competitor
Reasoning
Copilot has first-mover advantage and deep GitHub integration, but Cursor, Claude Code, and others are rapidly gaining ground. The market is fragmenting — yet GitHub's enterprise lock-in and Microsoft's distribution remain formidable.
Latest evidence (LR: 1.30)
Eroded trust in Azure and the rapid rise of local/open-source alternatives threaten Copilot's market dominance.
Resolution criteria
Resolves YES if any single competitor surpasses GitHub Copilot in monthly active paid users or revenue by end of 2027, per GitHub earnings, competitor disclosures, or credible analyst estimates (Gartner, IDC).
51% By 2029, >60% of new production code at Fortune 500 companies will be initially generated by AI and then reviewed by humans
CONSENSUS AI Coding 2029
By 2029, >60% of new production code at Fortune 500 companies will be initially generated by AI and then reviewed by humans
Reasoning
The AI-first workflow is already standard at many tech companies. As AI coding tools improve and enterprises adopt them, the generate-then-review pattern will become dominant for routine code. Enterprise inertia slows adoption but doesn't stop it.
Latest evidence (LR: 2.00)
New tools like Agents Observe show engineers shifting toward dashboards to filter and audit AI-generated multi-agent code
Resolution criteria
Resolves YES if a major developer survey (Stack Overflow, JetBrains, or GitHub) reports >60% of Fortune 500 enterprise respondents say AI generates the initial draft for the majority of their new code, by end of 2029.
75% At least one AI coding tool will achieve >90% on SWE-bench Verified by end of 2026
NEAR-TERM AI Coding 2026
At least one AI coding tool will achieve >90% on SWE-bench Verified by end of 2026
Reasoning
Progress on SWE-bench has been rapid — from ~2% in early 2024 to over 70% by early 2026. With agentic scaffolding improvements and better models, the 90% threshold is within reach in the near term.
Latest evidence (LR: 1.50)
Meta's open-sourcing of repoprover advances the automated logical verification necessary for AI models to conquer the final complex edge cases in SWE-bench.
Resolution criteria
Resolves YES if any AI coding system (model + scaffolding) scores >90% on SWE-bench Verified leaderboard with a publicly documented submission by December 31, 2026.
87% The AI coding tool market will see a major consolidation wave: 5+ funded startups (>$10M raised) will shut down or be acqui-hired by end of 2027
CONSENSUS AI Coding 2027
The AI coding tool market will see a major consolidation wave: 5+ funded startups (>$10M raised) will shut down or be acqui-hired by end of 2027
Reasoning
The market is overcrowded with AI coding startups competing on thin differentiation. As foundation model providers bundle coding features, standalone tools face margin compression. This pattern has played out in every enterprise SaaS wave.
Latest evidence (LR: 2.50)
OpenAI's TBPN acquisition and unsustainable compute economics signal an active consolidation and death of thin wrappers.
Resolution criteria
Resolves YES if 5+ AI coding tool startups that raised >$10M in venture funding announce shutdown, acqui-hire, or fire sale (acquisition at <1x total funding) by end of 2027, per Crunchbase or PitchBook data.
90% By 2030, the role title 'prompt engineer' will appear in fewer than 1% of software job postings, having been absorbed into standard engineering roles
CONSENSUS Labor 2030
By 2030, the role title 'prompt engineer' will appear in fewer than 1% of software job postings, having been absorbed into standard engineering roles
Reasoning
Prompt engineering is becoming a universal skill rather than a specialization. As AI tools improve at interpreting intent, explicit prompt crafting becomes less important. The standalone role is already declining in job boards.
Latest evidence (LR: 2.00)
Debugging AI process lifecycles proves AI interaction is a systems engineering task, absorbing the prompt engineering role
Resolution criteria
Resolves YES if 'prompt engineer' (as a distinct job title, not mentioned in requirements) constitutes <1% of software engineering job postings on LinkedIn, Indeed, or Glassdoor by end of 2030.
35% India will surpass the US in absolute number of developers actively using AI coding tools by end of 2028
CONSENSUS Labor 2028
India will surpass the US in absolute number of developers actively using AI coding tools by end of 2028
Reasoning
India already has more GitHub users than the US. AI coding tools are especially valuable for developers who learned English as a second language. Free tiers and lower-cost alternatives will drive massive adoption in India's enormous developer population.
Resolution criteria
Resolves YES if a credible survey or platform data (GitHub, JetBrains, Stack Overflow) shows India has more developers regularly using AI coding tools than the US by end of 2028.
10% The 'AI software engineer' job title will be among the top 5 fastest-growing tech roles in the US by 2028
CONSENSUS Labor 2028
The 'AI software engineer' job title will be among the top 5 fastest-growing tech roles in the US by 2028
Reasoning
Companies need engineers who can build, evaluate, and maintain AI systems — not just use them. This hybrid role combining ML engineering with software engineering is already proliferating at major tech companies.
Latest evidence (LR: 0.60)
AI unbundling into gig tasks and AI fluency becoming table stakes contradicts 'AI software engineer' remaining a breakout specialized title
Resolution criteria
Resolves YES if 'AI software engineer' or equivalent title appears in the top 5 fastest-growing tech job categories per LinkedIn's annual Jobs on the Rise report, BLS data, or Glassdoor's analysis for any year through 2028.
78% By 2030, the bottom quartile of software developer salaries (adjusted for inflation) will be LOWER than 2025 levels in the US
CONSENSUS Compensation 2030
By 2030, the bottom quartile of software developer salaries (adjusted for inflation) will be LOWER than 2025 levels in the US
Reasoning
AI tools commoditize routine coding work. While top engineers see pay increases, the floor drops as more people can do basic development. This mirrors what happened to web design salaries after template builders emerged.
Latest evidence (LR: 1.60)
Oracle replacing domestic workers with cheaper visa labor amid mass layoffs validates strong downward pressure on bottom-quartile US engineering salaries.
Resolution criteria
Resolves YES if BLS Occupational Employment Statistics or a major comp survey shows the 25th percentile of US software developer total compensation (inflation-adjusted to 2025 dollars) is lower in 2030 than in 2025.
68% Staff+ engineer compensation at top tech companies will increase by >20% in real terms between 2025 and 2030
CONSENSUS Compensation 2030
Staff+ engineer compensation at top tech companies will increase by >20% in real terms between 2025 and 2030
Reasoning
As AI handles more routine work, the premium on system design, cross-team coordination, and architectural judgment increases. Companies will pay more for the engineers who can direct AI-augmented teams and make high-stakes technical decisions.
Latest evidence (LR: 1.40)
Massive capital influx into high-performance systems engineering will increase demand and compensation for staff+ talent
Resolution criteria
Resolves YES if Levels.fyi data shows median total compensation for Staff/Principal engineers at FAANG-tier companies in 2030 is >20% higher in real terms (CPI-adjusted) than 2025 levels.
58% By 2029, freelance developer hourly rates on Upwork/Toptal for routine web development will drop below $25/hr globally (from ~$50+ in 2025)
CONTRARIAN Compensation 2029
By 2029, freelance developer hourly rates on Upwork/Toptal for routine web development will drop below $25/hr globally (from ~$50+ in 2025)
Reasoning
AI tools dramatically lower the skill floor for routine web development, expanding the supply of people who can deliver basic projects. Platform data already shows rate compression in commodity coding categories. But complex work may sustain premium rates.
Latest evidence (LR: 1.40)
Commoditization of standard React/CRUD web development supports a drop in routine freelance rates
Resolution criteria
Resolves YES if the median hourly rate for 'web development' category on Upwork or Toptal falls below $25/hr for the majority of quarters in 2029, per platform data or freelancer surveys.
77% A major AI-generated code vulnerability will cause a breach affecting >10 million users by end of 2027
CONSENSUS Security 2027 CONTESTED
A major AI-generated code vulnerability will cause a breach affecting >10 million users by end of 2027
Reasoning
As AI generates more production code with less human review, the attack surface expands. AI tends to reproduce common vulnerability patterns from training data. The sheer volume of AI-generated code makes a large-scale breach statistically likely.
Latest evidence (LR: 1.60)
The highly targeted Axios supply chain attack demonstrates how malicious actors can use social engineering to trick highly-leveraged AI agents into importing compromised dependencies.
Resolution criteria
Resolves YES if a data breach affecting >10M users is attributed primarily to a vulnerability in AI-generated code, per post-mortem analysis, CVE reports, or investigative journalism by end of 2027.
88% By 2028, >50% of enterprise SAST/DAST tools will have AI-code-specific scanning rules as a standard feature
CONSENSUS Security 2028
By 2028, >50% of enterprise SAST/DAST tools will have AI-code-specific scanning rules as a standard feature
Reasoning
The security industry always follows the market. As AI-generated code proliferates, security vendors are already adding AI-specific detectors. This is a natural evolution, not a bold prediction — it's already happening at vendors like Snyk and Semgrep.
Latest evidence (LR: 1.50)
Shift from application-level vulnerabilities to dev-environment and supply-chain hardening drives enterprise security needs
Resolution criteria
Resolves YES if >50% of the top 10 SAST/DAST tools by market share (per Gartner Magic Quadrant or Forrester Wave) include AI-generated code detection features in their standard product by end of 2028.
59% Prompt injection attacks targeting AI coding agents will become a new OWASP Top 10 category by 2028
NEAR-TERM Security 2028
Prompt injection attacks targeting AI coding agents will become a new OWASP Top 10 category by 2028
Reasoning
As AI agents gain code execution capabilities and access to production environments, prompt injection becomes a critical attack vector. OWASP has already flagged LLM-specific risks, but coding agent exploitation is an emerging subcategory that may earn top-level recognition.
Latest evidence (LR: 1.80)
Cloudflare intercepting ChatGPT output based on React state reveals critical new vulnerability surface between LLMs and app execution
Resolution criteria
Resolves YES if OWASP Top 10 (web or a new AI-specific list that gains equivalent industry adoption) includes a category specifically addressing prompt injection or manipulation of AI coding agents by end of 2028.
58% Fully automated AI test generation will cover >80% of test cases for new features at early-adopter companies by 2028
NEAR-TERM Automation 2028
Fully automated AI test generation will cover >80% of test cases for new features at early-adopter companies by 2028
Reasoning
AI is strong at generating unit tests from code, but integration and end-to-end tests require deep domain understanding. Early adopters will achieve high coverage for unit tests while integration testing remains largely human-driven, keeping overall averages below 80%.
Latest evidence (LR: 2.50)
Kitchen Loop framework operationalizes 'Unbeatable Tests', validating completely automated testing
Resolution criteria
Resolves YES if a peer-reviewed study, conference paper (ICSE, FSE), or major company tech blog documents >80% of test cases being auto-generated by AI for new feature development at a company with >500 engineers, by end of 2028.
13% By 2029, AI-driven continuous deployment pipelines will autonomously ship >40% of production changes at cloud-native companies without human approval
CONTRARIAN Automation 2029 CONTESTED
By 2029, AI-driven continuous deployment pipelines will autonomously ship >40% of production changes at cloud-native companies without human approval
Reasoning
Fully autonomous deployment requires trust in AI-generated tests, rollback mechanisms, and anomaly detection. While the technology exists in pieces, organizational risk aversion and compliance requirements will slow adoption beyond low-risk changes.
Latest evidence (LR: 0.70)
Growing reality check on deployment timelines for fully autonomous agents lowers the probability of autonomous CD pipelines near-term
Resolution criteria
Resolves YES if credible reporting or company disclosures show >40% of production deployments at 3+ companies with >1000 engineers are fully autonomous (no human in the loop) by end of 2029.
68% AI code review tools will catch >50% of bugs that currently escape human code review, as measured in controlled studies, by end of 2027
CONSENSUS Automation 2027
AI code review tools will catch >50% of bugs that currently escape human code review, as measured in controlled studies, by end of 2027
Reasoning
AI code review is improving rapidly and doesn't suffer from reviewer fatigue or social pressure to approve. Multiple startups and incumbents are investing heavily. The controlled study evidence is beginning to accumulate.
Resolution criteria
Resolves YES if a peer-reviewed study or credible industry report shows AI code review catching >50% of bugs that escaped human review in a controlled experiment with >100 PRs by end of 2027.
56% Pair programming between humans will decline by >50% at companies that adopt AI coding assistants, by 2028
CONSENSUS Workflow 2028
Pair programming between humans will decline by >50% at companies that adopt AI coding assistants, by 2028
Reasoning
AI assistants serve as always-available pair programmers. Why schedule synchronous pairing sessions when you can iterate with an AI? But some teams value pair programming for mentorship and knowledge sharing, which AI cannot fully replace.
Resolution criteria
Resolves YES if a developer survey or workplace study (n>500) at AI-adopting companies shows self-reported pair programming frequency dropping >50% compared to pre-AI-adoption levels by end of 2028.
46% By 2028, the average time from idea to deployed MVP will drop below 1 week for solo developers using AI tools, down from 4-8 weeks in 2024
CONSENSUS Workflow 2028
By 2028, the average time from idea to deployed MVP will drop below 1 week for solo developers using AI tools, down from 4-8 weeks in 2024
Reasoning
AI tools already enable rapid prototyping. Combining code generation, design tools, and deployment automation, solo developers can ship MVPs dramatically faster. This is already happening for simple apps; the trend will extend to more complex projects.
Latest evidence (LR: 1.25)
Discontinuation of Mac Pro underscores shift to cloud AI orchestration, supporting faster idea-to-MVP cycles unconstrained by local compute.
Resolution criteria
Resolves YES if a credible developer survey or platform data (e.g., Vercel, Replit, or Heroku analytics) shows median time from project start to production deployment is <7 days for solo AI-assisted projects by end of 2028.
89% +2 By 2029, the majority of developers will spend more time specifying requirements and reviewing outputs than writing code directly
CONSENSUS Workflow 2029
By 2029, the majority of developers will spend more time specifying requirements and reviewing outputs than writing code directly
Reasoning
The developer role is shifting from writing code to directing AI code generation. As AI handles more implementation, the bottleneck moves to clear specification and quality review. This is the natural evolution of the AI-assisted workflow.
Latest evidence (LR: 2.00)
A viral commit of 12,000 AI-generated posts highlights the explosive growth of AI bloat, shifting developer focus toward review and truncation.
Resolution criteria
Resolves YES if a major developer survey (JetBrains, Stack Overflow, or GitHub) reports >50% of respondents spend more time on requirements/review than direct code writing by end of 2029.
43% Computer science degree enrollment in the US will decline >15% from 2025 peak levels by 2029
NEAR-TERM Skills 2029
Computer science degree enrollment in the US will decline >15% from 2025 peak levels by 2029
Reasoning
The narrative that AI replaces programmers may deter students from CS programs. However, CS degrees have survived every previous 'coding is dead' scare, and the field's breadth (ML, systems, security) provides resilience beyond just coding.
Resolution criteria
Resolves YES if NCES (National Center for Education Statistics) or CRA Taulbee Survey data shows US CS degree enrollment in 2029 is >15% below the 2025 level.
95% +2 Systems programming skills (Rust, C++, kernel development) will command a HIGHER salary premium relative to web development by 2030 than in 2025
CONSENSUS Skills 2030
Systems programming skills (Rust, C++, kernel development) will command a HIGHER salary premium relative to web development by 2030 than in 2025
Reasoning
AI is better at generating high-level web code than low-level systems code. Performance-critical, safety-critical, and hardware-adjacent programming requires deep understanding that AI struggles to replicate. Supply of systems programmers stays tight while demand grows.
Latest evidence (LR: 2.50)
Widespread RAM shortages and GPU gamification demonstrate a cultural shift toward developers aggressively upskilling in hardware and systems programming.
Resolution criteria
Resolves YES if Levels.fyi, Stack Overflow survey, or BLS data shows the ratio of systems programming salaries to web development salaries in 2030 exceeds the 2025 ratio by any margin.
32% By 2028, bootcamp graduates using AI tools will perform comparably to CS degree holders in their first year of employment, closing the historical gap
CONTRARIAN Skills 2028
By 2028, bootcamp graduates using AI tools will perform comparably to CS degree holders in their first year of employment, closing the historical gap
Reasoning
Contrarian: AI tools compensate for gaps in theoretical knowledge by providing on-demand expertise. The practical skills bootcamps teach become more valuable when AI handles the CS fundamentals. But companies may still prefer degrees as a signal. Consensus says the degree gap remains important.
Resolution criteria
Resolves YES if a study (n>200) comparing bootcamp vs CS degree hires at comparable companies finds <10% difference in performance reviews, promotion rates, or productivity metrics in the first year, published by end of 2028.
30% The combined valuation of pure-play AI coding tool companies will exceed $200B by end of 2027
CONTRARIAN Market 2027
The combined valuation of pure-play AI coding tool companies will exceed $200B by end of 2027
Reasoning
Contrarian: Current valuations (Cursor at ~$50B) are already stretched. If the AI coding market consolidates, some players fail, and growth slows, the $200B aggregate may be a ceiling rather than a floor. Consensus says this market is just getting started.
Latest evidence (LR: 0.60)
A looming violent market correction and unsustainable unit economics threaten the high valuations of pure-play AI coding companies.
Resolution criteria
Resolves YES if the total post-money valuation (or market cap for public companies) of companies deriving >50% of revenue from AI coding tools exceeds $200B by end of 2027, per PitchBook, Crunchbase, or public market data.
80% Microsoft will bundle AI coding features into Visual Studio / VS Code at no additional charge, putting pricing pressure on the entire market, by end of 2027
CONSENSUS Market 2027
Microsoft will bundle AI coding features into Visual Studio / VS Code at no additional charge, putting pricing pressure on the entire market, by end of 2027
Reasoning
Microsoft's playbook has always been to bundle developer tools into its platform. With Copilot already integrated into VS Code, the natural move is to include basic AI coding features in the standard (free) tier to drive Azure consumption and undercut competitors.
Resolution criteria
Resolves YES if Microsoft offers core AI code completion and chat features in VS Code for free (not requiring a separate Copilot subscription) by end of 2027.
18% By 2029, at least one non-tech Fortune 100 company (bank, insurer, retailer) will build and open-source an AI coding tool that gains >10K GitHub stars
CONTRARIAN Market 2029
By 2029, at least one non-tech Fortune 100 company (bank, insurer, retailer) will build and open-source an AI coding tool that gains >10K GitHub stars
Reasoning
Large enterprises have the engineering talent and the domain-specific codebases to train specialized coding models. However, the organizational incentive to open-source such tools is weak, and most enterprises prefer to buy rather than build developer tools.
Resolution criteria
Resolves YES if a Fortune 100 company outside the tech sector releases an open-source AI coding tool or model that achieves >10K GitHub stars by end of 2029.
76% GPT-5 or its successor will achieve >95% on HumanEval and >80% on SWE-bench Verified within 6 months of release
CONSENSUS AI Capability 2027
GPT-5 or its successor will achieve >95% on HumanEval and >80% on SWE-bench Verified within 6 months of release
Reasoning
Each GPT generation has shown massive jumps on coding benchmarks. GPT-4 already scored high on HumanEval, and the trend line suggests the next frontier model will saturate simpler benchmarks while making strong gains on harder ones.
Resolution criteria
Resolves YES if the next major OpenAI model after GPT-4o (whatever it is named) achieves >95% on HumanEval and >80% on SWE-bench Verified within 6 months of public release.
26% By 2028, AI will be able to autonomously maintain a medium-complexity open-source project (>10K LoC, >100 issues/year) with minimal human intervention
CONTRARIAN AI Capability 2028
By 2028, AI will be able to autonomously maintain a medium-complexity open-source project (>10K LoC, >100 issues/year) with minimal human intervention
Reasoning
Autonomous project maintenance requires understanding user intent from issue reports, navigating complex codebases, avoiding regressions, and making design decisions. Current agents can fix isolated bugs but struggle with the holistic judgment that maintenance demands.
Latest evidence (LR: 0.70)
Mounting developer fatigue and reality checks on autonomous coding agents reduce near-term prospects for autonomous maintenance
Resolution criteria
Resolves YES if an AI system autonomously resolves >70% of issues (bug fixes, feature requests, dependency updates) on a public GitHub project with >10K LoC and >100 issues/year, sustained for >6 months, with <10% of its PRs reverted, by end of 2028.
33% +4 Multimodal AI coding tools that accept screenshots, diagrams, and voice input will capture >25% of the AI coding tool market by 2029
NEAR-TERM AI Capability 2029
Multimodal AI coding tools that accept screenshots, diagrams, and voice input will capture >25% of the AI coding tool market by 2029
Reasoning
Multimodal input is a natural evolution for coding tools — designers can screenshot UIs, PMs can sketch architectures, and developers can voice-describe changes. But text-based interaction has deep inertia in developer workflows, and multimodal adds latency.
Latest evidence (LR: 2.00)
The release of Pluck directly integrates UI structures into AI coding workflows, validating the shift toward multimodal inputs.
Resolution criteria
Resolves YES if AI coding tools with multimodal input (beyond text/code) capture >25% market share by revenue or MAU per a credible analyst report by end of 2029.
94% +2 The cost per token for frontier coding models will drop by >90% between 2025 and 2028
CONSENSUS AI Capability 2028 CONTESTED
The cost per token for frontier coding models will drop by >90% between 2025 and 2028
Reasoning
API pricing has already fallen dramatically (GPT-4 pricing dropped >10x in 18 months). Competition between providers, hardware improvements, distillation techniques, and efficiency gains virtually guarantee continued steep price declines.
Latest evidence (LR: 3.00)
The sllm platform drops effective access costs for frontier models to $5/month, signaling a massive reduction in token costs.
Resolution criteria
Resolves YES if the per-token API price for a frontier-quality coding model (comparable to best-in-class 2025 performance) is <10% of the January 2025 price for GPT-4 Turbo or Claude 3.5 Sonnet by end of 2028.
73% The EU will pass binding regulation requiring disclosure of AI-generated code in safety-critical software (aviation, medical devices, autonomous vehicles) by 2029
CONSENSUS Regulation 2029
The EU will pass binding regulation requiring disclosure of AI-generated code in safety-critical software (aviation, medical devices, autonomous vehicles) by 2029
Reasoning
The EU AI Act already establishes risk categories for AI systems. Extending disclosure requirements to AI-generated code in high-risk domains is a natural regulatory evolution. The EU has consistently led on tech regulation.
Resolution criteria
Resolves YES if the EU adopts binding regulation (not just guidance) requiring organizations to disclose or document use of AI-generated code in safety-critical software by end of 2029.
15% The EU's AI coding regulations will cause >20% of European startups to relocate or incorporate engineering teams outside the EU by 2030
CONTRARIAN Regulation 2030
The EU's AI coding regulations will cause >20% of European startups to relocate or incorporate engineering teams outside the EU by 2030
Reasoning
Regulatory burden may drive some startups away, but the EU market is too large to abandon. Most companies will comply rather than relocate. The 20% threshold is extremely high — even GDPR didn't cause that level of flight.
Resolution criteria
Resolves YES if credible reporting or startup surveys show >20% of EU-founded tech startups have moved primary engineering operations outside the EU specifically citing AI code regulations by end of 2030.
28% By 2030, India will have a domestically-developed AI coding tool in the global top 10 by revenue
CONTRARIAN Geopolitics 2030
By 2030, India will have a domestically-developed AI coding tool in the global top 10 by revenue
Reasoning
India has a massive developer base and growing AI talent, but its AI ecosystem lags the US and China in foundation model development. Government initiatives and the domestic market size provide opportunity, but building a globally competitive tool from India remains challenging.
Resolution criteria
Resolves YES if an AI coding tool developed by an India-headquartered company ranks in the global top 10 by revenue per credible analyst report by end of 2030.
54% US-China AI export controls will fragment the AI coding tool market into two incompatible ecosystems by 2029
NEAR-TERM Geopolitics 2029
US-China AI export controls will fragment the AI coding tool market into two incompatible ecosystems by 2029
Reasoning
Export controls are tightening, and China is building independent AI infrastructure. But coding tools are software, not hardware — fragmentation requires active blocking of tool access, not just chip restrictions. VPNs and cloud routing may prevent full fragmentation.
Latest evidence (LR: 1.50)
Non-US models matching frontier capabilities and hardware bypasses indicate a fracturing of the global AI ecosystem.
Resolution criteria
Resolves YES if by end of 2029, the top 3 AI coding tools used in China share zero overlap with the top 3 used in the US, and developers cannot easily use tools from the other ecosystem, per usage data or developer surveys.
58% By end of 2026, at least 3 Y Combinator batch companies will ship products built entirely by AI coding agents with <100 lines of human-written code
NEAR-TERM Market 2026
By end of 2026, at least 3 Y Combinator batch companies will ship products built entirely by AI coding agents with <100 lines of human-written code
Reasoning
YC is already seeing founders who use AI agents for nearly all coding. The barrier is not capability but willingness to trust AI-generated code for a production startup. Some founders will embrace this fully as a competitive advantage in speed.
Latest evidence (LR: 0.80)
Reality checks on autonomous coding capabilities make products built entirely by AI agents less likely in the near term
Resolution criteria
Resolves YES if 3+ companies from YC batches in 2026 publicly state or are credibly reported to have built their initial product with <100 lines of human-authored code, the rest being AI-generated.
10% By 2028, 'AI debugging' specialists — engineers who primarily diagnose and fix AI-generated code — will be a recognized job category at >10 major tech companies
CONTRARIAN Labor 2028
By 2028, 'AI debugging' specialists — engineers who primarily diagnose and fix AI-generated code — will be a recognized job category at >10 major tech companies
Reasoning
While AI-generated code does need debugging, it is unlikely to become a standalone job category. Debugging AI code requires the same skills as debugging human code. Companies are more likely to fold this into existing SRE or quality engineering roles than create a new category.
Latest evidence (LR: 1.40)
New tools designed to monitor, filter, and debug multi-agent AI teams point to a new specialization in AI debugging
Resolution criteria
Resolves YES if >10 companies with >5000 employees have a distinct job title or team specifically for diagnosing/fixing AI-generated code issues, per job postings or company org disclosures, by end of 2028.
77% At least one AI coding tool will achieve >80% on SWE-bench Verified by April 30, 2026
CONSENSUS AI Capability 2026-04
At least one AI coding tool will achieve >80% on SWE-bench Verified by April 30, 2026
Reasoning
Current best is ~72% as of March 2026. Rapid progress trajectory but the 80% threshold is a significant jump from 72%.
Latest evidence (LR: 1.30)
Meta's repoprover provides a pathway for AI models to deterministically verify code, accelerating their path to achieving high benchmarks on SWE-bench Verified.
Resolution criteria
Resolves YES if any AI system scores >80% on the SWE-bench Verified leaderboard by April 30, 2026.
66% Anthropic will release a new Claude model (Claude 4 or equivalent) before May 15, 2026
CONSENSUS AI Capability 2026-05
Anthropic will release a new Claude model (Claude 4 or equivalent) before May 15, 2026
Reasoning
Anthropic has been on roughly quarterly release cadence. Claude Opus 4.6 launched early 2026. A new frontier model seems likely within this window.
Latest evidence (LR: 1.50)
The leak of 3,000 files from Anthropic's 'Mythos' project hints at major impending architectural developments preceding a new model launch.
Resolution criteria
Resolves YES if Anthropic publicly launches a model described as a major new release (not a minor update) before May 15, 2026.
74% GitHub will announce changes to Copilot pricing or tiers before June 1, 2026
CONSENSUS Market 2026-06
GitHub will announce changes to Copilot pricing or tiers before June 1, 2026
Reasoning
GitHub has already moved to metered billing. Competition from Cursor and Claude Code is intensifying. More pricing changes are likely.
Latest evidence (LR: 1.80)
The $65 compute cost for $20 subscriptions points to an urgent need for aggressive price hikes in tools like Copilot.
Resolution criteria
Resolves YES if GitHub announces any change to Copilot pricing, plans, or billing structure before June 1, 2026.
46% At least 3 new AI coding startups will announce >0M funding rounds in April 2026
CONSENSUS Market 2026-04
At least 3 new AI coding startups will announce >0M funding rounds in April 2026
Reasoning
AI coding is the hottest VC category. Multiple startups are reportedly in late-stage fundraising. But >0M for 3+ in a single month is a high bar.
Latest evidence (LR: 0.60)
Drying venture subsidies and a violent market correction make near-term >$10M funding announcements less likely.
Resolution criteria
Resolves YES if 3+ AI coding tool startups announce 0M+ funding rounds in April 2026, per Crunchbase or TechCrunch reports.
65% The next major arXiv paper on AI code generation will report >75% on HumanEval+ before April 20, 2026
CONSENSUS AI Capability 2026-04
The next major arXiv paper on AI code generation will report >75% on HumanEval+ before April 20, 2026
Reasoning
The benchmark pace has been accelerating. Multiple labs are working on code models. A >75% HumanEval+ result seems achievable.
Resolution criteria
Resolves YES if an arXiv paper published before April 20 reports >75% on HumanEval+ (extended version) for a single model.
How prediction scoring works
Each prediction carries a probability estimate updated daily through a structured process:
- Signals are scraped from 15+ sources and filtered by an LLM for relevance
- Targeted search queries (via Tavily) find prediction-specific evidence
- An LLM estimates likelihood ratios: P(evidence | claim true) / P(evidence | claim false)
- Bayesian log-odds update with 0.3 dampening factor (prevents wild swings)
- 5 forecaster personas vote independently; median likelihood ratio applied as secondary adjustment
- Maximum daily movement: 5 percentage points. Bounds: 3% to 97%.