National AI Research Resource must balance the value of its data with privacy

The large number of parties expected to have access increases the risk data could be used to triangulate personally identifiable information.

By Dave Nyczepir

April 15, 2022

(Getty Images)

The task force developing recommendations on a National Artificial Intelligence Research Resource must balance the need to provide valuable data with the increased risk it could be used to triangulate personally identifiable information, given the large number of parties expected to have access, experts say.

Task force members want to include startups and small businesses developing privacy technologies among NAIRR‘s users, but exactly how resources, capabilities and policies would be integrated continues to be discussed, according to co-chair Manish Parashar.

Members previously stated that U.S.-based researchers and students — primarily in academia but also with companies that have received federal grants like Small Business Innovation Research or Small Business Technology Transfer funding — are target users of the NAIRR. Privacy technologies they’re developing could help the resource protect personally identifiable information (PII).

“Yes, the task force is certainly discussing how privacy-enabling technologies could help enhance the privacy aspects of NAIRR usage,” Parashar told FedScoop. “However, the task force has also discussed how privacy requires more than just technical solutions, and we expect a full range of considerations when contemplating privacy, civil rights and civil liberties.”

Data used to train machine learning (ML) algorithms can be anonymized to a degree, but the process is never absolute, which means PII can be correlate with enough effort.

Startups like integrate.ai, which advocates privacy by design, see an opportunity for the NAIRR to not only include them but use their privacy-enhancing technologies: federated learning, differential privacy, homomorphic encryption and secure multi-party computation.

“I would love to see a privacy track, a privacy initiative that both leverages the research value of the resource but also supports the whole initiative to actually protect the privacy of that information,” said Karl Martin, senior vice president of technology at integrate.ai.

Martin envisions a cluster of researchers and companies with a mandate to support the NAIRR with privacy-enhancing technologies that others may or are required to use to access the resource’s data, in addition to advancing their own work.

Database-style access controls are the “most basic” form of privacy limiting organizations based on data type, and they would likely become “frustrating” for NAIRR users, Martin said.

On the other hand, federated learning allows ML algorithms to be built without directly accessing data and can be compounded with additional layers of privacy, like differential privacy, to make reverse engineering back to the original data difficult, he added.

Whatever privacy technologies the task force ultimately recommends should be based on a smart data philosophy, opting for ones associated with the data rather than the systems.

“What’s the value of this data?” Martin said. “Then what are the protection mechanisms that can surround the data?”

National AI Research Resource must balance the value of its data with privacy

More Like This

Nand Mulchandani steps down as CTO of the CIA

Anthropic makes generative AI widely available at major national lab

EPA IT chief’s warning: AI can’t be used ‘to solve any problem’

Top Stories

Supreme Court allows federal workforce reductions to move forward

Tech upgrades at State passport centers fueled customer service turnaround, GAO says

Space, satellite, defense companies urge lawmakers to fund NOAA space commerce office

CDC data chief announces departure from agency

Oracle products discounted under GSA OneGov deal

More Scoops

Bipartisan bill to codify AI research resource at NSF gets reboot in House

California Republican looks to codify NAIRR, establish select committee on AI

Trump orders review of Biden admin’s AI work, creation of new AI action plan

Europe’s fight for AI transparency faces staff and pacing challenges. The US can take note.

The government is working to improve data access. An AI chatbot could be part of that.

OpenAI further expands its generative AI work with the federal government

AI national security memo aims to avoid U.S. ‘squandering’ its lead

Latest Podcasts

Supreme Court allows federal workforce reductions to move forward; Anthropic makes generative AI widely available at major national lab

How agentic AI can improve efficiency and reduce costs for federal agencies

CDC data chief announces departure from agency; Oracle products discounted under GSA OneGov deal

Salt Typhoon ‘largely contained’ in telecom networks; Pentagon’s AI office eliminates CTO directorate in pursuit of ‘efficiencies’

Tech

Defense

Cyber

FedScoop TV