IRS’s AI system to flag returns for audit may include unintended bias, report finds

Following a report identifying racial disparities in audit selection, the GAO says the tax agency hasn’t conducted a “comprehensive review” of the rules and filters in its Dependent Database.

By Matt Bracken

May 23, 2024

A view of the IRS headquarters. (Photo by Karen Bleier/AFP via Getty Images)

The IRS’s primary tool for flagging tax returns for audit is a “first-wave” AI system that includes inputs from humans, according to a new watchdog report, opening the door for unintended bias at a time when the agency is attempting to combat racial disparities in auditing.

The Government Accountability Office found no evidence that the tax agency has conducted a “comprehensive review of the rules and filters contained” in its Dependent Database, an automated program that identifies returns with possible noncompliance risk. The DDB is considered first-wave AI by the GAO due to it having “expert knowledge encoded into a computer system.”

“While IRS regularly reviews the program, the review process does not comprehensively consider data inputs and assumptions that could inform IRS about the demographic equity of the audit selection process, creating the potential for unintended bias in audit selection,” the report stated. “For example, GAO found that some risk scores contained in the DDB program vary by sex, which could skew selection, and have not been updated since 2001.”

A 2023 Stanford University study found that Black taxpayers are roughly three-to-five times more likely to be audited than filers of other races. The IRS later confirmed the study’s findings, with Commissioner Danny Werfel writing in a letter to Congress that the agency would be “laser-focused” on addressing racial disparities in auditing.

The GAO noted that the tax agency does not collect data about taxpayers’ race and ethnicity, meaning that predictions about a return’s risk for noncompliance with tax codes doesn’t take either factor into account. But according to the GAO, IRS research still shows “the existence of racial disparities in audits,” with “unintentional algorithmic biases” identified as a possible source.

“Specifically, that research noted (1) limitations in the data used to determine residency and relationship tests for [Earned Income Tax Credit] eligibility, and (2) outdated models as possible contributions to algorithmic bias and, consequently, racial disparities in audits,” the report states.

Once a return is flagged by the DDB program, it is then evaluated by the agency’s Systems Research and Application (SRA) model, which determines the filer’s risk score. Considered second-wave AI, the SRA is a data-mining and machine-learning model that the IRS uses to pinpoint audit patterns and predict outcomes.

The GAO identified “some components” of the IRS Wage & Investment Division’s “automated audit selection process that could potentially skew selection toward returns with certain demographic characteristics that may not necessarily represent returns with the highest risk of noncompliance.” The SRA ranks risk scores from highest to lowest, and W&I starts with the highest until meeting “its predetermined audit workload,” the watchdog noted.

The GAO pushed the IRS to abide by its AI accountability framework, particularly with regard to “a variety of monitoring activities” that should be followed “to ensure AI systems function as intended.”

“The agency may be missing opportunities to improve the likelihood that IRS is properly identifying returns at highest risk of noncompliance if it does not consider additional performance measures in reviewing its automated audit selection process,” the report said.

The GAO delivered six recommendations to the IRS regarding its audit selection processes, all of which were agreed to by the agency.

IRS’s AI system to flag returns for audit may include unintended bias, report finds

More Like This

Consumer product regulators’ AI pilot bill clears House

Elon Musk’s Grok is now working with the US government

IRS’s data-sharing deal with ICE will lead to ‘dangerous’ mistakes, digital rights group argues

Top Stories

VA acting CIO defends IT workforce reorg amid lawmaker pushback

Login.gov looks to accept mobile driver’s licenses in ‘near future’

White House names new official to oversee federal statistical system

Grok for gov? GitHub shows GSA interest in Elon Musk AI tool

Rep. Mace reintroduces bill to modernize VA with blockchain technology

More Scoops

Lack of IRS transparency on AI jeopardizes public trust, advisory panel says

Watchdog pushes IRS on stronger oversight of identity-proofing program

‘AI boom’ will make up for IRS workforce cuts, Treasury secretary says

Stilted innovation, less efficiency: Former IRS executives explain the impact of DOGE cuts

IRS doesn’t have the tech to ‘backfill the gaps’ created by cuts, ex-commissioner says

IRS, DOGE sued by union groups over access to tax agency systems

IRS’s AI voicebots and chatbots have room to grow, advisory panel says

Latest Podcasts

Elon Musk’s Grok is now working with the US government; Pentagon awards mega contracts for new ‘frontier AI’ projects

GSA’s plans to test the controversial AI tool Grok; Why IRS’s data-sharing deal with ICE could lead to ‘dangerous’ mistakes

A pair of departures in the federal technology community

Supreme Court allows federal workforce reductions to move forward; Anthropic makes generative AI widely available at major national lab

Tech

Defense

Cyber

FedScoop TV