Category Archives: Industry & Policy

For anyone tracking where response integrity is headed: market research industry shifts, election security policy, blockchain regulation, federal and state RFP activity, and the emerging category of verified-integrity infrastructure. Articles here read the room for buyers and operators, with commentary on news, standards, and the players defining the space. Lighter on product, heavier on point of view.

Data Integrity Language for RFPs and Proposals: Copy, Paste, Win

Free clause language for research buyers and firms. Raise the bar, then clear it.

Why this exists

Data integrity is showing up in research RFPs, but the language is usually vague: “describe your data quality procedures.” Vague requirements get vague answers, and vague answers are how fraudulent data keeps winning. This post gives both sides better words.

If you are a research buyer, the clauses below put real teeth in your next RFP. If you are a research firm, the proposal language below turns data integrity from a compliance paragraph into a competitive weapon.

Use any of it freely. Adapt to your counsel’s taste; this is practical language, not legal advice.

For buyers: RFP requirements that actually filter vendors

Respondent identity verification “Vendor shall describe its method for verifying the identity of survey respondents prior to participation, including what percentage of respondents undergo identity verification and by what mechanism. Reliance on panel self declaration alone does not satisfy this requirement.”

Fraud screening at submission “Vendor shall describe automated fraud controls applied at the point of response submission, including bot detection and validation of respondent data against independent third party databases. Vendor shall report the rejection rate at submission for comparable studies.”

Tamper evident data record “Vendor shall maintain an independent, tamper evident record of the completed dataset, created at or immediately following data collection, sufficient to demonstrate that delivered data has not been altered. Vendor shall describe how the buyer can independently verify this record.”

Audit and transparency “Upon request, vendor shall provide an audit trail covering respondent verification status, submission screening results, and dataset integrity verification, within five business days.”

Ask these four questions and watch the field narrow. Most vendors can answer the second. Very few can answer the first and third.

For research firms: proposal language that wins the integrity question

Short version (for a capabilities matrix): “All respondents are KYC identity verified prior to participation. Submissions are screened in real time via CAPTCHA and validation against an 80M+ national residential address database. Completed datasets are recorded immutably on the XRP Ledger immediately following collection, providing a tamper evident, independently auditable record of data integrity.”

Long version (for a methodology section): “Our data integrity model operates in three layers rather than relying on post hoc cleaning. First, respondent identity: participants complete KYC identity verification before answering, ensuring responses originate from verified, real individuals rather than unverified panel identities. Second, submission screening: each response passes automated fraud controls at the moment of submission, including bot detection and validation against an independent national address database covering more than 80 million records. Third, integrity of record: upon submission, the completed responses are written to the XRP Ledger as an immutable record, meaning the delivered dataset can be independently verified as unaltered from the point of collection. The result is a dataset whose integrity we do not merely assert, but can demonstrate.”

Note the construction: identity before, screening at, record after. Keeping the sequence precise is part of the credibility. The ledger record proves the data was never altered; it is created after submission and is not itself the identity check.

The strategic point

Whoever introduces integrity language into a deal controls the evaluation. If you are a firm and your RFP response includes requirements your competitors cannot meet, you have changed the question from “who is cheapest” to “who can prove their data.” That is a question you want.

If meeting this language is the gap, that is what we build. BallotHut provides the KYC verification, submission screening, and XRPL integrity record as a layer on your studies, and a paid single study pilot is the fastest way to test it on real work. Details at ballothut.com.

Tracy Wehringer CMO, BallotHut

How to Prove Your Survey Data Is Clean to a Skeptical Client

Detection protects your dataset. Proof protects your client relationship. They are not the same thing.

The question that ends client relationships

It rarely arrives as an accusation. It arrives as a polite question in a readout: “How confident are we in this sample?” Or a procurement line: “Describe your data integrity controls.” Or worst, a quiet one: your client’s stakeholder saw the headlines that a third of survey responses are now fraudulent, and now every surprising finding in your report carries an asterisk.

Here is the uncomfortable position most research firms are in: even when the data is clean, they cannot prove it. Internal cleaning logs are not proof; they are your own homework, graded by you. “We removed 12% of responses in QA” does not reassure a client. It tells them 12% of what you fielded was bad, and invites the obvious follow up: how do you know you caught the rest?

What counts as proof, and what does not

Does not count as proof:

  • Your panel provider’s quality claims. That is their assertion, passed through you.
  • Attention checks and trap questions passed. Modern AI generated responses pass these.
  • Internal cleaning documentation. Editable by you, therefore not independent.
  • A confident tone in the readout.

Counts as proof:

  • Evidence the respondent was a verified, real person, established before they answered.
  • Evidence the response was screened at submission against independent data, not just your own rules.
  • An independent, tamper evident record showing the dataset has not been altered since collection, verifiable by someone other than you.

The pattern: proof is independent, and it exists at every stage, not just at cleanup.

The three questions your client is really asking

1. “Were these real people?” Answer it with identity, not inference. KYC verification before a respondent answers means you can say “every response in this dataset came from a verified identity,” which is a categorically different sentence than “we screen for bots.” The research industry’s open secret is that most vendors cannot verify everyone; saying you can, and showing it, separates you immediately.

2. “Did anything fake get through?” Answer it with screening at the gate: CAPTCHA plus validation against an independent national address database at the moment of submission. Fraud stopped at entry never needs to be found in cleaning, and your cleaning rate becomes a small number you are happy to share.

3. “Has the data been touched since?” Answer it with an immutable record. When the completed responses are written to the XRP Ledger after submission, the dataset becomes tamper evident: anyone, including your client’s own analyst, can confirm it matches what was collected. You are no longer asking to be trusted; you are handing over the means to verify.

How to put this in front of a client

A simple integrity statement at the front of the report, three lines:

  1. All responses in this study came from KYC verified identities.
  2. Submissions were screened in real time against [controls], with a [X]% rejection rate at the gate.
  3. The completed dataset is recorded immutably on the XRP Ledger and is independently auditable; the verification reference is available on request.

Then watch what happens in the room. The data quality conversation, which used to be a defensive moment, becomes a selling moment. You are the firm that proves it.

Try it on one study

The fastest way to experience the difference is a paid pilot on a single project: your study, fielded with all three layers on, with a before and after you can show your client. Details at ballothut.com.

Tracy Wehringer CMO, BallotHut

Survey Fraud in 2026

How AI Broke Survey Research, And What to Do About it.

by Tracy A. Wehringer, CMO

Executive Summary

Survey research is facing an existential crisis. The same AI technologies transforming business are simultaneously destroying the integrity of survey data. In 2026, nearly one-third of all survey responses are fraudulent, and traditional fraud detection methods catch almost none of them.

This report examines the scope of the AI survey fraud epidemic, explains why conventional defenses have failed, and outlines the emerging solutions that can restore trust in survey data.

Key Findings

  • 31% of raw survey responses now contain fraud, up from an estimated 10-15% in 2022
  • As few as 10-52 fake responses can flip poll results in political and market research
  • 38% of collected survey data is now discarded due to suspected fraud, wasting research budgets
  • 57% of document fraud is now AI-generated, a 244% year-over-year increase

The Bottom Line: Traditional fraud detection methods were designed for human bad actors. They are fundamentally incapable of stopping AI-powered fraud. A new approach is required: verifying respondent identity after survey completion.

The Scale of the Problem

Survey fraud is not new. Researchers have battled fraudulent responses for decades. But generative AI has fundamentally changed the economics and sophistication of fraud, transforming a manageable nuisance into a crisis threatening the validity of all survey-based research.

This figure represents a dramatic acceleration. Industry estimates from 2020-2022 placed fraud rates at 10-15% of responses. The introduction of ChatGPT in late 2022 and subsequent large language models created an inflection point, enabling fraud at scale with unprecedented sophistication.

The Academic Research Crisis

Academic researchers have been hit particularly hard. A 2023 study published in Wiley’s Applied Economic Perspectives and Policy documented an extreme case: 96% of responses to an online survey were identified as fraudulent.

The researchers noted that fraudulent respondents had become indistinguishable from legitimate participants using traditional screening methods. Standard academic practices, university email verification, attention checks, and response time analysis, proved ineffective against AI-powered fraud.

Market Research Under Siege

The market research industry, valued at over $80 billion globally, faces a credibility crisis. According to IPQS (IP Quality Score), 20% of market research data submitted to clients contains fraudulent responses, responses that passed all quality checks before delivery.

This has cascading effects across industries:

  • Companies making product decisions based on corrupted data
  • Political campaigns misreading voter sentiment
  • Healthcare organizations drawing incorrect conclusions about patient experiences
  • HR departments making policy changes based on fraudulent employee feedback

Why Traditional Defenses Fail

The survey industry has relied on a standard toolkit of fraud prevention measures for over a decade. In the age of generative AI, every single one of these defenses has been neutralized.

The 99.8% Problem

A landmark 2025 study from Dartmouth College tested AI bots against standard survey quality measures. The results were devastating for the industry:

The bots successfully defeated security checks and exhibited human-like response timing. Traditional quality indicators, straight-lining detection, speeder flags, gibberish filters, were essentially useless.

Defense-by-Defense Breakdown

Response Time Analysis

Bot operators have adapted. Modern survey bots incorporate randomized delays, simulate reading time proportional to question length, and even mimic human patterns like slower responses on complex questions. The Dartmouth study found bot response patterns statistically identical to human participants.

Email and IP Verification

Fraudsters operate with thousands of unique email addresses and rotate through residential IP pools. A single bot operator can present as thousands of distinct individuals across different geographic locations.

Fraud Defense Effectiveness: Pre-AI vs. Post-AI

The Economics of AI Survey Fraud

Understanding why fraud has exploded requires examining the economic incentives. AI has fundamentally altered the cost-benefit calculation, making survey fraud extraordinarily profitable.

The $0.05 vs. $1.50 Gap

The Dartmouth research documented the core economic driver:

Human RespondentAI Bot Operator
$1.50 average payout$0.05 cost per response
10-15 minutes per survey30 seconds per survey
Limited daily capacityThousands daily
Geographic constraintsGlobal operation

A single bot operator running automated scripts can complete thousands of surveys daily. At a 30x profit margin per response and near-zero marginal cost to scale, the economic incentive is overwhelming.

The Fraud Industry Infrastructure

Survey fraud has evolved from individual bad actors to an organized industry with sophisticated infrastructure:

  • Bot-as-a-Service platforms offering survey completion at scale
  • Residential proxy networks masking bot traffic as legitimate users
  • AI model fine-tuning specifically for survey response generation
  • Identity farms providing unique email/phone combinations

Real-World Consequences

Survey fraud is not an abstract data quality issue. Corrupted survey data leads to real-world decisions with significant consequences.

Political Polling Manipulation

The Dartmouth study demonstrated that remarkably small numbers of fraudulent responses can alter research outcomes:

In close elections or contested policy debates, this represents a serious vulnerability. Bad actors can influence perceived public opinion at minimal cost, potentially affecting media coverage, campaign strategy, and policy decisions.

Business Intelligence Failures

Companies relying on customer feedback surveys, employee engagement studies, and market research face corrupted intelligence:

  • Product teams launching features based on fake user preferences
  • HR departments misreading employee sentiment and engagement
  • Marketing campaigns targeting phantom customer segments
  • Executive decisions based on fundamentally flawed data
  • M&A due diligence compromised by unreliable market research

The Hidden Cost: Data Waste

Research organizations have responded to the fraud epidemic by discarding suspicious data; often far more than necessary due to inability to distinguish real from fake:

This represents massive waste: research budgets hemorrhaging value, extended project timelines, and reduced sample sizes that compromise statistical validity.

The True Cost of Survey Fraud (per $100K research project): 
• Data collection cost: $100,000
• Usable data after fraud screening: 62% ($62,000 value)
• Discarded data: 38% ($38,000 wasted)
• Additional collection to reach target n: +$15,000-25,000
• Extended timeline: 2-4 weeks delay Total impact: 40-60% budget inefficiency

The Broader AI Fraud Context

Survey fraud exists within a larger epidemic of AI-generated deception affecting all forms of digital verification and authentication.

Document Fraud Explosion

The Entrust Cybersecurity Institute’s 2025 Identity Fraud Report documented the acceleration:

Deepfakes, synthetic identities, and AI-generated documents have moved from theoretical concerns to operational realities. Organizations across sectors are confronting the same fundamental challenge: how do you verify that a human is real?

The Rise of Proof-of-Humanity

This crisis has spawned a new category of solutions focused on verifying human identity in digital interactions. The market validation is significant:

Market IndicatorValue/Projection
Humanity Protocol Valuation$1.1B (Jan 2025)
Humanity Protocol Funding$50M raised
Digital ID Verification Market (2034)$27B+
Decentralized Identity Market (2035)$620B+
Enterprises Considering Blockchain for ID65%
Digital ID Apps Projected (2030)6.2 billion

The Path Forward

The survey industry cannot defend against AI fraud using pre-AI tools. A new approach is required—one that verifies respondent authenticity at a fundamental level rather than attempting to detect fraud after the fact.

From Fraud Detection to Identity Verification

The paradigm shift required is moving from probabilistic fraud detection to deterministic identity verification:

Old Model: Fraud DetectionNew Model: Identity Verification
Responses collectedVerified survey deployed
Fraud analysis (probabilistic)Responses collected
Maybe valid dataDefinitively valid data

The critical insight: verification must happen before or during survey completion—not after. Once a fraudulent response is in your dataset, you’re guessing about which responses to keep.

Key Components of Next-Generation Verification

Address-Based Identity

Verifying that a respondent exists at a real physical address provides a foundation of authenticity that AI cannot easily fabricate. When combined with a comprehensive address database, this creates a verification layer tied to the physical world. An AI bot cannot claim to live at 123 Main Street if that address can be validated against 80M+ verified residential addresses.

Blockchain-Backed Verification

Immutable verification records prevent tampering and create auditable proof of respondent authenticity. Each verified response can be traced to its verification event, providing defensible data integrity.

Geographic Intelligence

Beyond simple verification, understanding the geographic distribution of responses provides additional validity signals. For government and community surveys, this enables verification that respondents actually live in the jurisdiction they’re providing feedback about.

Conclusion

The survey industry stands at a crossroads. The AI technologies that have revolutionized countless industries have simultaneously undermined the foundational assumption of survey research: that responses come from real humans sharing genuine opinions.

The statistics are stark:

  • 31% fraud rates in raw survey data
  • 99.8% of AI bots passing traditional quality checks
  • 38% of research budgets wasted on unusable data
  • 10-52 fake responses sufficient to flip poll results

Traditional defenses have failed. The gap between fraud sophistication and detection capability widens daily. Organizations continuing to rely on attention checks, and response time analysis are not protecting their data, they’re providing themselves false confidence while fraud passes undetected.

But this crisis also presents an opportunity. Organizations that adopt next-generation verification, moving from fraud detection to identity verification, will possess a significant competitive advantage: data they can actually trust.

The question is no longer whether to address survey fraud, but how quickly organizations can implement solutions before the credibility of their research is irreparably compromised. The future of survey research belongs to organizations that can prove their data comes from verified humans, not those hoping their fraud filters catch what AI throws at them.

References

Survey Fraud Research

Research Defender. (2024). Survey fraud detection benchmarks.

Dartmouth College. (2025). AI bots and survey quality: A comprehensive analysis. Study Finds.

Kennedy, A., Barkley, B., et al. (2023). Battling bots: Experiences and strategies to mitigate fraudulent responses in online surveys. Applied Economic Perspectives and Policy, Wiley.

IPQS – IP Quality Score. (2024). Market research fraud analysis.

MRS – Market Research Society. (2023). AI-generated response trends in market research.

Greenbook. (2024). Online Survey Frauds in Market Research: Challenges and Solutions.

CHEQ. (2024). Survey Bots: How They Manipulate Data and Skew Results.

Identity Verification & Fraud

Entrust Cybersecurity Institute. (2025). Identity fraud report: AI-generated document fraud trends.

Humanity Protocol. (2025). Series A funding announcement. CoinDesk.

Fortune Business Insights. (2025). Digital identity verification market forecast 2025-2034.

Biometric Update. (2025). Humanity Protocol raises $20M at $1.1B valuation.

Market Data

Mordor Intelligence. (2025). Survey software market analysis.

GM Insights. (2025). Decentralized identity market size and forecast.

Polaris Market Research. (2025). Blockchain identity verification market trends.

Market Research Future. (2024). Online Survey Software Market Size, Share, Report, Forecast 2035.

About the Author

Tracy A. Wehringer, MBA serves as the fractional Chief Marketing Officer at BallotHut. With extensive C-suite advisory experience including Global 500 clients, Tracy brings deep expertise in revenue marketing, go-to-market strategy, and emerging technology positioning.

About BallotHut

BallotHut is a Proof of Human Response platform that verifies survey respondents are real people at real addresses using blockchain-backed authentication.Built on XRPL technology with access to an 80M+ National Address Database, BallotHut integrates with existing survey platforms to provide the verification layer organizations need to trust their data. ballothut.com

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

For media inquiries: tracy@ballothut.com

12 Signs Your Survey Dataset Has a Fraud Problem

A quick diagnostic checklist for research and insights teams. If three or more sound familiar, keep reading.

Fraud rarely announces itself. With roughly a third of survey attempts now fraudulent and AI generated answers passing standard quality checks, the signs are subtle, statistical, and easy to rationalize away. Here are twelve worth taking seriously.

1. Your open ends got better. Suspiciously better. Fluent, on topic, well-structured answers at scale are now more likely to be a language model than an unusually articulate panel. The old tells (gibberish, copy paste) are gone; eloquence is the new red flag.

2. Completion times cluster too tightly. Real humans are messy: some race, some wander off and return. When a large share of completes land in a narrow time band, automation or scripted farms are the likelier explanation.

3. Straight lining has gotten smarter. Instead of all 5s, you see plausible variation that never quite contradicts itself. Sophisticated fraud mimics attentiveness; check whether grid answers correlate too perfectly with each other.

4. Incidence rates do not match reality. When 30% of your general population sample claims to own a boat, manage enterprise IT budgets, or have a rare condition, fraudsters are qualifying into your highest paying screeners.

5. Demographics shift between waves. A tracker whose respondent profile drifts wave to wave without a real world reason often means the fraudulent share of your panel is changing underneath you.

6. Geography and IP do not line up. Respondents claiming one location while submitting from another, or clusters of completes from data center IP ranges, point to farms and proxies.

7. Your cleaning rate keeps creeping up. If you removed 5% of completes two years ago and remove 15% now, the question is not whether fraud is rising; it is how much is still getting through, since cleanup only catches what your rules can see.

8. Trap questions stopped trapping. When attention check failure rates fall while everything else looks worse, the fraud has learned your checks. AI assisted respondents pass traps designed for careless humans.

9. Surprising findings keep failing to replicate. Fraud injects noise that masquerades as insight. If your interesting subgroup differences evaporate on re fielding, contamination is a prime suspect.

10. The same “person” keeps coming back. Matching response patterns, device fingerprints, or open end phrasing across supposedly different respondents means duplicates or a persona farm.

11. Your panel provider cannot answer the identity question. Ask directly: what share of these respondents passed identity verification, and by what method? If the answer is a quality score rather than a verification method, identity is unverified.

12. A client asked, and you got defensive. The clearest sign of all. If “how do we know these are real people” produces discomfort instead of a document, your process has a proof gap regardless of how clean the data actually is.

What to do with your count

0 to 2 signs: Stay vigilant; your exposure is likely moderate. Your next move is proof: being able to demonstrate integrity, not just maintain it.

3 to 5 signs: You have a live problem. Take our Survey Fraud Risk Scorecard to locate exactly where fraud is entering: at identity, at submission, or after.

6 or more: Your datasets are materially contaminated, and cleaning alone will not fix it, because cleaning only catches what your rules already know to look for. The fix is structural: verify identity before anyone answers, screen at submission, and keep a tamper evident record after.

That three layer structure is what BallotHut does: KYC verified identities, CAPTCHA plus 80M+ address database screening at submission, and an immutable post submission record on the XRP Ledger. The fastest way to see the difference is a paid pilot on one of your own studies.

Tracy Wehringer CMO, BallotHut