{"id":43157,"date":"2025-07-27T15:55:31","date_gmt":"2025-07-27T19:55:31","guid":{"rendered":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/"},"modified":"2025-07-27T15:55:31","modified_gmt":"2025-07-27T19:55:31","slug":"bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025","status":"publish","type":"post","link":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/","title":{"rendered":"Bloomberg&#8217;s AI engineers introduce an improved agent tool-calling methodology at ACL 2025"},"content":{"rendered":"<div class='bbg-row bbg-bg--white  bbg-row--margin-top-none bbg-row--margin-bottom-none' data-anchor='row-6a092c866f172'>\n  \n\t\n\t\n\t<div class=\"bbg-row--content\">\n\t\t\n\t\t\t<div class='bbg-column bbg-column--width-8 bbg-column--offset-2'>\n\t<div class='bb-wysiwyg'>\n    \n    <p>During the <a href=\"https:\/\/2025.aclweb.org\/\" target=\"_blank\" rel=\"noopener\">63rd Annual Meeting of the Association for Computational Linguistics<\/a> (ACL 2025) this week in Vienna, Austria, researchers from Bloomberg\u2019s AI Engineering group in London are showcasing their expertise in large language models (LLMs) and tool-based agentic AI with their paper \u201c<a href=\"https:\/\/aclanthology.org\/2025.findings-acl.1149\/\" target=\"_blank\" rel=\"noopener\">A Joint Optimization Framework for Enhancing Efficiency of Tool Utilization in LLM Agents<\/a>.\u201d<\/p>\n<p>In the paper, which is published &#8220;<a href=\"https:\/\/aclanthology.org\/volumes\/2025.findings-acl\/\" target=\"_blank\" rel=\"noopener\">Findings of the Association for Computational Linguistics: ACL 2025<\/a>,&#8221; Bin Wu, a <a href=\"https:\/\/www.bloomberg.com\/company\/stories\/introducing-the-sixth-cohort-of-bloomberg-data-science-ph-d-fellows-2023-2024\/\" target=\"_blank\" rel=\"noopener\">Bloomberg Data Science Ph.D. Fellow<\/a> and Ph.D. student at University College London, Edgar Meij, Head of AI Platforms in Bloomberg\u2019s AI Engineering group, and <a href=\"https:\/\/sites.google.com\/site\/emineyilmaz\/\" target=\"_blank\" rel=\"noopener\">Emine Yilmaz<\/a>, a professor and EPSRC Fellow at University College London\u2019s Department of Computer Science \u2013 where she also leads the Web Intelligence Group at the UCL Centre for Artificial Intelligence \u2013 demonstrate the crucial role of the instructions provided in agent prompts and tool descriptions \u2013 collectively referred to as context. Incomplete or suboptimal context in the instructions and tool descriptions significantly increases the required number of tool calls that LLMs need to make in order to generate an adequate response, leading to computational overhead. They propose a new methodology for automatically improving agent prompts and tool descriptions, and demonstrate that it substantially reduces the number of tool calls the LLM agent needs to make.<\/p>\n\n<\/div>\n<figure class=\"image-figure image-figure--has-small-image\" data-animation=\"\">\n    <img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"720\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png\" class=\"attachment-full size-full image-figure__image image-figure__image--primary\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 1280w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 280w\" sizes=\"(max-width: 1280px) 100vw, 1280px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"720\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 1280w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 280w\" sizes=\"(max-width: 1280px) 100vw, 1280px\" \/>\n    \n<\/figure>\n<div class='bb-wysiwyg'>\n    \n    <p>In addition, two members of Bloomberg\u2019s AI Strategy &amp; Research team in the company\u2019s CTO Office \u2013 Sebastian Gehrmann, Head of Responsible AI, and Enrico Santus, Principal Technical Strategist for Human-AI Interaction and Academic Engagement \u2013 are two of the organizers of the fourth iteration of the <a href=\"https:\/\/gem-benchmark.com\/workshop\" target=\"_blank\" rel=\"noopener\">Generation, Evaluation &amp; Metrics Workshop (GEM2)<\/a>, which will be held as part of ACL on July 31, 2025. In light of the broad accessibility of LLMs, this workshop will serve as a forum for researchers and practitioners from both the natural language processing and machine learning communities to come together to explore potential approaches and research directions to address the broader types of natural language generation (NLG) challenges \u2013 in particular, the evaluation of model-generated outputs. While these advanced models can generate fluent text, ensuring the usefulness, quality, and fairness of their output is essential to help bridge the gap between research and real-world applications.<\/p>\n<p>We asked the paper\u2019s lead author and one of the workshop organizers to explain why their work is notable in advancing the state-of-the-art with regards to LLMs and agentic AI:<\/p>\n<hr \/>\n<h3 style=\"text-align: center;\"><strong><u>Wednesday, July 30, 2025<\/u><\/strong><\/h3>\n<p><em>Session 12: IP-Posters (Findings Posters &#8211; In-Person 4)<\/em><br \/>\n<em>11:00-12:30 CEST<\/em><\/p>\n<p><a href=\"https:\/\/aclanthology.org\/2025.findings-acl.1149\/\" target=\"_blank\" rel=\"noopener\"><strong>A Joint Optimization Framework for Enhancing Efficiency of Tool Utilization in LLM Agents<\/strong><\/a><br \/>\nBin Wu (Centre for Artificial Intelligence, University College London), Edgar Meij (Bloomberg), Emine Yilmaz (Centre for Artificial Intelligence, University College London)<\/p>\n\n<\/div>\n<figure class=\"image-figure image-figure--has-small-image\" data-animation=\"\">\n    <a class='image-figure__link' href='https:\/\/aclanthology.org\/2025.findings-acl.1149.pdf' target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" width=\"1654\" height=\"2339\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png\" class=\"attachment-full size-full image-figure__image image-figure__image--primary\" alt=\"Click to read &quot;A Joint Optimization Framework for Enhancing Efficiency of Tool Utilization in LLM Agents,&quot; published in &quot;Findings of the Association for Computational Linguistics&quot; at ACL 2025.\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 1654w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 212w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 724w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 1086w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 1448w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 134w\" sizes=\"(max-width: 1654px) 100vw, 1654px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"1654\" height=\"2339\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"Click to read &quot;A Joint Optimization Framework for Enhancing Efficiency of Tool Utilization in LLM Agents,&quot; published in &quot;Findings of the Association for Computational Linguistics&quot; at ACL 2025.\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 1654w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 212w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 724w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 1086w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 1448w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/2025.findings-acl.1149_Page_01.png 134w\" sizes=\"(max-width: 1654px) 100vw, 1654px\" \/><\/a>\n    \n<\/figure>\n<div class='bb-wysiwyg'>\n    \n    <p><strong>Please summarize your research. Why are your results notable?<\/strong><\/p>\n<p><strong>Bin Wu:<\/strong> This research proposes a joint optimization framework that aims to improve the efficiency of tool-augmented LLM agents by systematically refining both agent instructions and tool descriptions. Traditional approaches have either focused on enhancing tool use effectiveness through reasoning strategies \u2013 like chain-of-thought (CoT) or tree-of-thoughts (ToT) prompting \u2013 or optimized only a single aspect (i.e., either the instructions or the tool documentation). However, these prior methods incur high computational costs and often overlook efficiency, particularly under conditions where context is incomplete.<\/p>\n<p>Our proposed framework introduces a three-stage process:<\/p>\n<ul>\n<li><strong>Feedback Generator:<\/strong> Evaluates effectiveness and efficiency of tool calls.<\/li>\n<li><strong>Suggestion Coordinator:<\/strong> Produces separate improvement suggestions for agent prompts and tool docs.<\/li>\n<li><strong>Context Refiner:<\/strong> Processes these suggestions to stably and scalably update the context.<\/li>\n<\/ul>\n<p>Notable results include the following:<\/p>\n<ol>\n<li>We show that incomplete context requires LLMs to call more tools to generate their response.<\/li>\n<li>We confirm up to a 70% reduction in required tool calls on StableToolBench and 47% fewer redundant calls on RestBench while maintaining or improving pass rates.<\/li>\n<\/ol>\n<p><strong>Why is it important to optimize context to improve the efficiency of agentic tool calling?<\/strong><\/p>\n<p>In practice, incomplete context is very common. This occurs because agent instructions are always designed manually through much trial-and-error. Plus, tool descriptions are also designed by humans, and it is especially difficult to capture for complex tools. Revealed by our empirical analysis, an incomplete context is one of the things that lead to computational overhead. Thus, when end-to-end agentic LLMs use tools, optimizing context is essential to help improve their efficiency.<\/p>\n<p><strong>How does your research advance the state-of-the-art in the field of agentic\/generative AI?<\/strong><\/p>\n<p>This work advances the field in the following key ways:<\/p>\n<ul>\n<li><strong>Joint optimization of context:<\/strong> Most prior research improved either the <em>agent prompt<\/em> or the <em>tool descriptions<\/em>, but not both simultaneously. This study is the first to propose a <em>joint, automated optimization<\/em> of both, acknowledging their interaction and combined effect on agent performance.<\/li>\n<li><strong>Verbalized optimization pipeline:<\/strong> Instead of relying on resource-intensive model fine-tuning, the authors introduce a <em>training-free, text-based optimization<\/em> framework. It uses the LLMs themselves to produce feedback and improvements \u2013 making it scalable and applicable to <em>closed-source or resource-constrained environments<\/em>.<\/li>\n<li><strong>New evaluation metric \u2013 CAPR:<\/strong> The introduction of <em>Cost-Aware Pass Rate (CAPR)<\/em> is a significant contribution. Unlike traditional metrics focused solely on effectiveness, CAPR incorporates computational cost, thereby aligning better with real-world requirements for efficient and cost-effective AI agents.<\/li>\n<\/ul>\n<p><strong>Were there any surprising or unexpected outcomes from your research?<\/strong><\/p>\n<p>Yes, several findings were unexpected and noteworthy:<\/p>\n<ul>\n<li><strong>Incomplete context hampers efficiency more than effectiveness.<\/strong> While incomplete context degrades performance as expected, our experiments revealed that it particularly worsens efficiency \u2013 not just effectiveness. Agents still solved tasks, but did so needing far more tool calls, highlighting a hidden cost that was overlooked in prior research.<\/li>\n<li><strong>Tool descriptions also play a large role.<\/strong> Contrary to common assumptions that agent instructions are the dominant factor, jointly-optimized tool descriptions yield far greater efficiency gains than instruction improvements alone.<\/li>\n<li><strong>Verbalized optimization can overfit.<\/strong> Iterative context refinement sometimes led to overfitting, where additional iterations increased the required tool calls and degraded performance. This mirrors overfitting in traditional machine learning and suggests the need for regularization techniques in verbalized optimization.<\/li>\n<\/ul>\n<p><em>Read more about Bloomberg\u2019s agentic AI infrastructure <a href=\"https:\/\/www.bloomberg.com\/company\/stories\/closing-the-agentic-ai-productionization-gap-bloomberg-embraces-mcp\/\" target=\"_blank\" rel=\"noopener\">here<\/a>.<\/em><\/p>\n\n<\/div>\n<div class=\"bb-separator\" data-color=\"\">\n\t<hr class=\"bb-separator__rule\">\n<\/div>\n<div class='bb-wysiwyg'>\n    \n    <h3 style=\"text-align: center;\"><strong><u>Thursday, July 31, 2025<\/u><\/strong><\/h3>\n<p><strong><a href=\"https:\/\/gem-benchmark.com\/workshop\" target=\"_blank\" rel=\"noopener\">GEM2 Workshop: Generation, Evaluation &amp; Metrics<\/a><br \/>\n<\/strong>Sebastian Gehrmann (Bloomberg), Gabriel Stanovsky (Hebrew University of Jerusalem), Simon Mille (Dublin City University), Enrico Santus (Bloomberg), Miruna Clinciu (Heriot Watt University), Kaustubh Dhole (Emory University), Yotam Perlitz (IBM Research), Rotem Dror (University of Haifa), Itay Itzhak (Hebrew University of Jerusalem), Ofir Arviv (IBM Research), Eliya Habba (Hebrew University of Jerusalem), Michal Shmueli Scheuer (IBM Research), Jo\u00e3o Sedoc (New York University) and Oyvind Tafjord (Allen Institute for Artificial Intelligence)<\/p>\n\n<\/div>\n<div class='bb-wysiwyg'>\n    \n    <p><strong>Please explain the goal of this workshop. Why are you helping to organize it?<\/strong><\/p>\n<p><strong>Enrico Santus:<\/strong> This is the fourth edition of the Generation, Evaluation &amp; Metrics Workshop (GEM). My colleague, Sebastian Gehrmann, originally started it in 2020, when evaluation of generated text first started becoming incredibly important. Now that GenAI is ubiquitous, GEM has grown into one of the largest workshops held at any NLP conference, and we couldn\u2019t be more excited to help lead it together with the outstanding organizing team.<\/p>\n<p>As GenAI is increasingly used for high-impact applications \u2013 from healthcare to robotics and finance \u2013 the stakes for evaluation have never been higher. Yet, many of today\u2019s benchmarks are brittle, hard to reproduce, or fail to reflect real-world complexity. That\u2019s why we believe GEM2 will help shift the field toward more meaningful, efficient, and robust evaluation practices.<\/p>\n<p>This year, more than 86 scientific publications will be presented at GEM2, alongside three keynotes and a panel. Moreover, for the second year, we have also built a space where industry and academia can meet each other through a dedicated Industrial Track. That conversation will be catalyzed in a panel with leading voices from DeepMind, Contextual AI, and aiXplain, during which the speakers will share what it means to evaluate generative models in real-world production environments.<\/p>\n<p><strong>How do you expect or hope that this workshop will help advance the state-of-the-art in terms of the evaluation of LLMs?<\/strong><\/p>\n<p>We hope GEM2 helps change how our community thinks about evaluation. Right now, much of the focus in LLM benchmarking is on leaderboards, but they don\u2019t tell the full story. Models are sensitive to prompting, few-shot formatting, and even punctuation. Reproducibility is a challenge, and many current metrics don\u2019t reflect how models behave under pressure or in production. GEM2 encourages the field to go deeper, to explore robustness, fairness, instruction-following variance, and real-world generalization.<\/p>\n<p>We\u2019re incredibly fortunate to have three invited speakers who each bring powerful perspectives:<\/p>\n<ul>\n<li>Barbara Plank (LMU Munich) will present new work on ambiguity, inconsistency, and flawed reasoning in LLMs.<\/li>\n<li>Leshem Choshen (MIT-IBM) will dive into underexplored frontiers, like pretraining evaluation, tinyBenchmarks, multicultural benchmarking, and the risks of data contamination.<\/li>\n<li>Ehud Reiter (University of Aberdeen) will challenge us to go beyond metrics and focus on real-world impact.<\/li>\n<\/ul>\n<p>Most importantly, GEM2 is about community. Over the past four years, the GEM community has grown into a vibrant global network, bringing together hundreds of contributors from across continents, disciplines, and institutions. Through their work, the GEM community is shaping the future of NLP evaluation, and we are excited to be among its hosts.<\/p>\n\n<\/div>\n\n<\/div>\n\n\n\t\t\n\t<\/div>\n<\/div>\n\n","protected":false},"excerpt":{"rendered":"<p>At ACL 2025, a paper by Bloomberg&#8217;s AI engineers aims to improve the efficiency of agentic tool calling, while a workshop they&#8217;ve helped organize seeks to make LLM evaluation more meaningful, efficient, and robust<\/p>\n","protected":false},"author":184,"featured_media":43160,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1466],"tags":[2328,2329,2330,2331,1498,1578,1472,2332,2237,2236,2146,2325,2144,2333,2145,1477],"class_list":["post-43157","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-tech-at-bloomberg","tag-acl","tag-acl-2025","tag-agentic-ai","tag-agentic-tool-calling","tag-ai","tag-artificial-intelligence","tag-data-science","tag-evaluation","tag-gen-ai","tag-genai","tag-generative-ai","tag-information-retrieval","tag-large-language-models","tag-llm","tag-llms","tag-research"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v19.11 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Bloomberg&#039;s AI engineers introduce an improved agent tool-calling methodology at ACL 2025 | Bloomberg LP<\/title>\n<meta name=\"description\" content=\"At ACL 2025, Bloomberg&#039;s AI engineers seek to improve the efficiency of agentic tool calling and make LLM evaluation more meaningful &amp; robust\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Bloomberg&#039;s AI engineers introduce an improved agent tool-calling methodology at ACL 2025 | Bloomberg LP\" \/>\n<meta property=\"og:description\" content=\"At ACL 2025, Bloomberg&#039;s AI engineers seek to improve the efficiency of agentic tool calling and make LLM evaluation more meaningful &amp; robust\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/\" \/>\n<meta property=\"og:site_name\" content=\"Bloomberg L.P.\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/bloomberglp\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-07-27T19:55:31+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1280\" \/>\n\t<meta property=\"og:image:height\" content=\"720\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"chaas30\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:image\" content=\"https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png\" \/>\n<meta name=\"twitter:creator\" content=\"@bloomberg\" \/>\n<meta name=\"twitter:site\" content=\"@bloomberg\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"chaas30\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/\",\"url\":\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/\",\"name\":\"Bloomberg's AI engineers introduce an improved agent tool-calling methodology at ACL 2025 | Bloomberg LP\",\"isPartOf\":{\"@id\":\"https:\/\/www.bloomberg.com\/company\/#website\"},\"datePublished\":\"2025-07-27T19:55:31+00:00\",\"dateModified\":\"2025-07-27T19:55:31+00:00\",\"author\":{\"@id\":\"https:\/\/www.bloomberg.com\/company\/#\/schema\/person\/4d4a18aae79d6fcc1ea98181a906905e\"},\"description\":\"At ACL 2025, Bloomberg's AI engineers seek to improve the efficiency of agentic tool calling and make LLM evaluation more meaningful & robust\",\"breadcrumb\":{\"@id\":\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":\"1\",\"name\":\"Home\",\"item\":\"https:\/\/www.bloomberg.com\/company\/\"},{\"@type\":\"ListItem\",\"position\":\"2\",\"name\":\"Bloomberg&#8217;s AI engineers introduce an improved agent tool-calling methodology at ACL 2025\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.bloomberg.com\/company\/#website\",\"url\":\"https:\/\/www.bloomberg.com\/company\/\",\"name\":\"Bloomberg L.P.\",\"description\":\"Bloomberg L.P. is the leader in global business and financial information, enabling customers to make smarter, faster, more informed business decisions.\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.bloomberg.com\/company\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.bloomberg.com\/company\/#\/schema\/person\/4d4a18aae79d6fcc1ea98181a906905e\",\"name\":\"Bloomberg L.P.\",\"url\":\"https:\/\/www.bloomberg.com\/company\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Bloomberg's AI engineers introduce an improved agent tool-calling methodology at ACL 2025 | Bloomberg LP","description":"At ACL 2025, Bloomberg's AI engineers seek to improve the efficiency of agentic tool calling and make LLM evaluation more meaningful & robust","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/","og_locale":"en_US","og_type":"article","og_title":"Bloomberg's AI engineers introduce an improved agent tool-calling methodology at ACL 2025 | Bloomberg LP","og_description":"At ACL 2025, Bloomberg's AI engineers seek to improve the efficiency of agentic tool calling and make LLM evaluation more meaningful & robust","og_url":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/","og_site_name":"Bloomberg L.P.","article_publisher":"https:\/\/www.facebook.com\/bloomberglp\/","article_published_time":"2025-07-27T19:55:31+00:00","og_image":[{"width":1280,"height":720,"url":"https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png","type":"image\/png"}],"author":"chaas30","twitter_card":"summary_large_image","twitter_image":"https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png","twitter_creator":"@bloomberg","twitter_site":"@bloomberg","twitter_misc":{"Written by":"chaas30","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/","url":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/","name":"Bloomberg's AI engineers introduce an improved agent tool-calling methodology at ACL 2025 | Bloomberg LP","isPartOf":{"@id":"https:\/\/www.bloomberg.com\/company\/#website"},"datePublished":"2025-07-27T19:55:31+00:00","dateModified":"2025-07-27T19:55:31+00:00","author":{"@id":"https:\/\/www.bloomberg.com\/company\/#\/schema\/person\/4d4a18aae79d6fcc1ea98181a906905e"},"description":"At ACL 2025, Bloomberg's AI engineers seek to improve the efficiency of agentic tool calling and make LLM evaluation more meaningful & robust","breadcrumb":{"@id":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineers-introduce-an-improved-agent-tool-calling-methodology-acl-2025\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":"1","name":"Home","item":"https:\/\/www.bloomberg.com\/company\/"},{"@type":"ListItem","position":"2","name":"Bloomberg&#8217;s AI engineers introduce an improved agent tool-calling methodology at ACL 2025"}]},{"@type":"WebSite","@id":"https:\/\/www.bloomberg.com\/company\/#website","url":"https:\/\/www.bloomberg.com\/company\/","name":"Bloomberg L.P.","description":"Bloomberg L.P. is the leader in global business and financial information, enabling customers to make smarter, faster, more informed business decisions.","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.bloomberg.com\/company\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.bloomberg.com\/company\/#\/schema\/person\/4d4a18aae79d6fcc1ea98181a906905e","name":"Bloomberg L.P.","url":"https:\/\/www.bloomberg.com\/company"}]}},"featured_image_rendered":"<img srcset='https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&type=webp&url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 280w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&type=webp&url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&type=webp&url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&type=webp&url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&type=webp&url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png 1280w' src='https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&type=webp&url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2025\/07\/ACL-2025-Paper_Workshop.png' alt='' \/>","category_info":{"name":"Tech At Bloomberg","blog_landing_name":"Tech At Bloomberg"},"_links":{"self":[{"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/posts\/43157","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/users\/184"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/comments?post=43157"}],"version-history":[{"count":4,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/posts\/43157\/revisions"}],"predecessor-version":[{"id":43163,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/posts\/43157\/revisions\/43163"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/media\/43160"}],"wp:attachment":[{"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/media?parent=43157"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/categories?post=43157"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/tags?post=43157"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}