PACT, head-to-head LLM negotiation benchmark. 20-round buyer-seller bargaining game: each round the AIs can message, the buyer submits a bid and the seller submits an ask. If bid ≥ ask, trade clears at the midpoint. Thousands of matchups.

r/singularity
Generative AI Open Source AI AI Research

PACT tests negotiation under partial information: persuasion, commitment, deception, anchoring, threats, and adaptation across repeated rounds. info, game logs, charts: GPT-5.5, Opus 4.7, DeepSeek V4 Pro, Gemini 3.1 Pro, Kimi K2.6 are the top 5. Note that opponent mixes vary by model and charts like Average Profit by Round do not control for them. submitted by /u/zero0_one1 [link] [comments]