Executive Gov
  • Home
  • Acquisition & Procurement
  • Agencies
    • DoD
    • Intelligence
    • DHS
    • Civilian
    • Space
  • Cybersecurity
  • Technology
  • Awards
  • News
  • About
  • Wash100
  • Contact Us
    • Advertising
    • Submit your news
No Result
View All Result
Executive Gov
  • Home
  • Acquisition & Procurement
  • Agencies
    • DoD
    • Intelligence
    • DHS
    • Civilian
    • Space
  • Cybersecurity
  • Technology
  • Awards
  • News
  • About
  • Wash100
  • Contact Us
    • Advertising
    • Submit your news
No Result
View All Result
Executive Gov
No Result
View All Result
Home Artificial Intelligence

DOW, ODNI Seek Proposals for AI Evaluation Harness & Benchmark Framework

by Miles Jamison
May 18, 2026
in Artificial Intelligence, Defense And Intelligence, News
DOW

DOW

The Department of War, in coordination with the Office of the Director of National Intelligence, is seeking industry proposals for an evaluation harness and government-defined benchmarks that would enable rigorous, reproducible and vendor-agnostic testing of artificial intelligence systems against criteria specified by the government.

Table of Contents

    • You might also like
    • Scott Breor to Lead CISA Infrastructure Security Division Amid Agency Leadership Changes
    • Army Unveils xTech|Inversion Competition to Drive IP Commercialization
    • Senate Panel Advances NDAA Measure Restricting Defense Contractor Stock Buybacks
  • What Features Are Required in the Evaluation Harness?
  • What Standards Must the New Benchmarks Meet?
  • Why Is the Government Expanding AI Evaluation Capabilities?

You might also like

Scott Breor to Lead CISA Infrastructure Security Division Amid Agency Leadership Changes

Army Unveils xTech|Inversion Competition to Drive IP Commercialization

Senate Panel Advances NDAA Measure Restricting Defense Contractor Stock Buybacks

DOW, ODNI Seek Proposals for AI Evaluation Harness & Benchmark Framework

Sign up for the Potomac Officers Club’s 2026 Artificial Intelligence Summit on March 18 to hear Cameron Stanley, chief digital and AI officer at the Department of War, and other federal, defense and industry leaders discuss the impact of AI, machine learning and automation.

What Features Are Required in the Evaluation Harness?

According to the commercial solutions opening notice published by the Defense Innovation Unit, the War Department is pursuing an evaluation harness that connects to AI models, facilitates evaluation workflows and measures their performance against benchmarks. The harness should support human-in-the-loop, agentic and adversarial evaluations. It should simulate an integrated environment to continuously test and monitor an AI model performance in challenging settings. Furthermore, the harness should generate evaluation reports and manage benchmark execution.

What Standards Must the New Benchmarks Meet?

Vendors must provide methodologies for creating benchmarks across unclassified, secret and top secret workflows that are resistant to gaming, adaptable as requirements and AI models evolve, and supported by training materials. These benchmarks should identify capabilities for particular missions, break those capabilities into measurable tasks and create realistic evaluation scenarios. They should also define clear scoring criteria, establish fair performance baselines using open models and ensure benchmarks are valid, reliable and capable of distinguishing different levels of performance.

Why Is the Government Expanding AI Evaluation Capabilities?

The government is pursuing new evaluation systems to address the rapid advancement of AI technologies. The new infrastructure should be able to evaluate newly released AI models against mission-specific benchmarks. In addition, the system should assess human-machine collaboration to determine whether joint operations yield better mission outcomes than either humans or automated systems alone.

The effort, dubbed “Mystic Depot,” follows calls by Pentagon leadership to accelerate the adoption of AI across warfighting and administrative operations, DefenseScoop reported. Interested vendors can submit their responses to the CSO by March 24.

Share5Tweet19

Recommended For You

Scott Breor to Lead CISA Infrastructure Security Division Amid Agency Leadership Changes

by Jane Edwards
June 16, 2026
Cybersecurity and Infrastructure Security Agency seal. CISA Associate Director Scott Breor will lead the agency’s ISD.

CISA has named Scott Breor to lead its Infrastructure Security DivisionLeadership changes follow Steve Casapulla's move to the White House cyber officeThe 2026 Homeland Security Summit will feature...

Read moreDetails

Army Unveils xTech|Inversion Competition to Drive IP Commercialization

by Jane Edwards
June 16, 2026
U.S. Army logo. The Army FUZE xTech Program has introduced the xTech|Inversion competition to advance IP commercialization.

The Army has launched a $1 million competition focused on commercializing Army-developed IPU.S. small businesses can compete by proposing transition and commercialization strategies for 16 Army IP setsThe...

Read moreDetails

Senate Panel Advances NDAA Measure Restricting Defense Contractor Stock Buybacks

by Miles Jamison
June 16, 2026
U.S. Senate seal. The Senate Armed Services Committee approved a measure restricting stock buybacks.

Senate panel backs NDAA measure linking defense contractor stock buybacks to Pentagon performance standardsContractors would need to show plans for expanding production capacity to avoid potential limits on...

Read moreDetails

SBA, GSA Remove Falsely Advertised ‘Made in America’ Products From GSA Marketplace

by Jamie Bennet
June 16, 2026
Kelly Loeffler. The SBA Administrator commented on foreign manufacturers disguising their products as American-made.

The SBA and GSA de-listed 22 products from the GSA Advantage! website for falsely claiming to be "made in America"Sherrill Manufacturing complained that China-based companies were unfairly using...

Read moreDetails

GAO Urges Navy to Accelerate Robotic, Autonomous Systems Development

by Miles Jamison
June 16, 2026
Government Accountability Office logo. GAO urged the U.S. Navy to hasten the development of robotic and autonomous systems.

GAO has warned the Navy that its push for autonomous and robotic systems is being slowed by internal hurdlesShifting leadership priorities have reportedly hindered a consistent autonomous technology...

Read moreDetails
Sign Up For Our Newsletter
Subscribe to our mailing list to receives daily updates direct to your inbox!
Invalid email address
Your privacy is guranteed.
Thanks for subscribing!

Sponsors

About ExecutiveGov

ExecutiveGov, published by Executive Mosaic, is a site dedicated to the news and headlines in the federal government. ExecutiveGov serves as a news source for the hot topics and issues facing federal government departments and agencies such as Gov 2.0, cybersecurity policy, health IT, green IT and national security. We also aim to spotlight various federal government employees and interview key government executives whose impact resonates beyond their agency.

CATEGORIES

  • Acquisition & Procurement
  • Announcements
  • Articles
  • Artificial Intelligence
  • Awards
  • Big Data & Analytics News
  • C4ISR
  • Civilian
  • Cloud
  • Contract Awards
  • Cybersecurity
  • Defense And Intelligence
  • Defense Security Cooperation
  • DHS
  • Digital Modernization
  • DoD
  • Events
  • Executive Moves
  • Executive Spotlights
  • Federal Civilian
  • Financial Reports
  • Foreign Military Sales
  • General News
  • GovCon Expert
  • Government Technology
  • GSA
  • Healthcare IT
  • Industry News
  • Intelligence
  • Legislation
  • M&A Activity
  • National Security
  • News
  • Policy Updates
  • Press Releases
  • Profiles
  • Space
  • Videos
  • Wash100
Sign Up For Our Newsletter
Subscribe to our mailing list to receives daily updates direct to your inbox!
Invalid email address
Your privacy is guranteed.
Thanks for subscribing!

Copyright 2026 Executive Mosaic. All Rights Reserved.

No Result
View All Result
  • Home
  • Acquisition & Procurement
  • Agencies
    • DoD
    • Intelligence
    • DHS
    • Civilian
    • Space
  • Cybersecurity
  • Technology
  • Awards
  • News
  • About
  • Wash100
  • Contact Us
    • Advertising
    • Submit your news

Copyright 2026 Executive Mosaic. All Rights Reserved.

Get your free GovCon news!

Get your latest GovCon news and insights. Become a VIP and subscribe to the GovConWire Daily News.

Invalid email address
We promise not to spam you. You can unsubscribe at any time.
Thanks for subscribing!