{"id":28133,"date":"2022-12-07T07:14:18","date_gmt":"2022-12-07T12:14:18","guid":{"rendered":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/"},"modified":"2024-04-12T10:21:03","modified_gmt":"2024-04-12T14:21:03","slug":"bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022","status":"publish","type":"post","link":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/","title":{"rendered":"Bloomberg\u2019s AI Engineering Group &#038; CTO Office Publish 5 NLP Research Papers at EMNLP 2022"},"content":{"rendered":"<div class='bbg-row bbg-bg--white  bbg-row--margin-top-none bbg-row--margin-bottom-none' data-anchor='row-6a0ab7124fcee'>\n  \n\t\n\t\n\t<div class=\"bbg-row--content\">\n\t\t\n\t\t\t<div class='bbg-column bbg-column--width-8 bbg-column--offset-2'>\n\t<div class='bb-wysiwyg'>\n    \n    <p>During the <a href=\"https:\/\/2022.emnlp.org\/\" target=\"_blank\" rel=\"noopener noreferrer\">2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)<\/a> in Abu Dhabi this week, researchers from Bloomberg\u2019s <a href=\"https:\/\/www.TechAtBloomberg.com\/AI\" target=\"_blank\" rel=\"noopener noreferrer\">AI Engineering Group<\/a> and CTO Office are showcasing their expertise in natural language processing (NLP) by publishing five papers. Three papers will appear in Findings of EMNLP 2022. One of these, along with another paper, is also being presented in the virtual poster session during the <a href=\"https:\/\/gem-benchmark.com\/workshop\" target=\"_blank\" rel=\"noopener noreferrer\">Second Version of Generation, Evaluation &amp; Metrics (GEM) Workshop 2022<\/a>. A fifth paper will be presented at the <a href=\"https:\/\/sigtyp.github.io\/ws2022-mrl.html\" target=\"_blank\" rel=\"noopener noreferrer\">2nd Multilingual Representation Learning (MRL) Workshop<\/a>.<\/p>\n<p>Through these papers (links and PDFs of the papers will be added once the <em>Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing<\/em> is available online), the authors and their collaborators highlight a variety of NLP applications, novel approaches and improved models used in key tasks, and other advances to the state-of-the-art in the field of computational linguistics.<\/p>\n<p>We asked some of the authors to summarize their research and explain why the results were notable:<\/p>\n<hr \/>\n<h3 style=\"text-align: center;\"><\/h3>\n<p><strong>Sequentially Controlled Text Generation<br \/>\n<\/strong>Alexander Spangher (USC Viterbi School of Engineering\/Bloomberg Data Science Ph.D. Fellow), Yao Ming\/Xinyu Hua (Bloomberg), Nanyun Peng (UCLA)<\/p>\n<p><em>Findings of EMNLP 2022<br \/>\n<\/em><a href=\"https:\/\/gem-benchmark.com\/workshop\" target=\"_blank\" rel=\"noopener noreferrer\">GEM Workshop<\/a><em>, Virtual Poster Session (Wednesday, December 7, 2022 @ 9:00 PM GST)<\/em><em><br \/>\n<\/em><\/p>\n\n<\/div>\n<div class=\"bb-separator\" data-color=\"\">\n\t<hr class=\"bb-separator__rule\">\n<\/div>\n<div class='bb-wysiwyg'>\n    \n    <p><strong>Please summarize your research. Why are your results notable?<\/strong><\/p>\n<p><strong>Xinyu:<\/strong> We started off with a basic problem: different consumers of news require varied levels of background context, prefer specific types of article structures depending on what platform they\u2019re reading the news and how much time they have to consume it, and often have different reasons for reading the content. So, given the main subject of a news article, could we fill in relevant contextual information? Could we experiment with different structural modes of storytelling? Could we personalize the news article to fit the different users\u2019 needs?<\/p>\n<p>The unifying principle behind all these questions is: how can we control the macro-structure of the news story we wish to write? We identified this core research question and formulated a novel AI task to address it, something we call \u201csequentially controlled text generation.\u201d In this task, the algorithm is provided with a topic (i.e., in the form of a prompt) and a sequence of structural codes (i.e., \u201cMain Event\u201d -&gt; \u201cBackground\u201d -&gt; \u201cPrevious Event\u201d) that govern \u2013 on a sentence-level \u2013 the macro-structure we wish the generated story to exhibit.<\/p>\n<p><strong>Alex:<\/strong> In our work, we (1) develop baseline methods to solve this task and (2) further study how much structural awareness during story generation contributes to well-structured and fluent text. In our experiments, we use headlines as textual input to the language model and Van Dijk discourse tags (<a href=\"https:\/\/doi.org\/10.4324\/9780203062784\" target=\"_blank\" rel=\"noopener noreferrer\">News as Discourse<\/a>. Routledge. 2013.) as control codes, as illustrated in the Figure below, where \u201cMain Event,\u201d \u201cHistorical Event,\u201d and \u201cExpectation\u201d are the control codes, conditioned on the title \u201cNeo-Nazi murder gang member jailed for life in Germany.\u201d<\/p>\n\n<\/div>\n<figure class=\"image-figure image-figure__center image-figure--has-small-image\" data-animation=\"\">\n    <img loading=\"lazy\" decoding=\"async\" width=\"768\" height=\"286\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image4.png\" class=\"attachment-medium_large size-medium_large image-figure__image image-figure__image--primary\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image4.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image4.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image4.png 280w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image4.png 822w\" sizes=\"(max-width: 768px) 100vw, 768px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"822\" height=\"306\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image4.png\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image4.png 822w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image4.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image4.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image4.png 280w\" sizes=\"(max-width: 822px) 100vw, 822px\" \/>\n    <figcaption class='image-figure__caption'><b><i>Figure 1<\/i><\/b><i><span style=\"font-weight: 400\">: An example illustrating the sequential structure of a news article. Our model aims to faithfully generate the sentences conditioned on the headline and discourse tags (Main Event, Historical Event, and Expectation).<\/span><\/i><\/figcaption>\n<\/figure>\n<div class='bb-wysiwyg'>\n    \n    <p><span style=\"font-weight: 400;\">Our system combines two important approaches in text generation: generation and editing. During generation, we first perturb the output of a language model using a structurally-aware classifier and generate the next word by sampling from the perturbed distribution. Editing is performed at the end of each generated sentence: we identify the most \u201csalient\u201d words, which contribute the most to the discriminator\u2019s prediction on the desired discourse class.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">We use heuristics based on part-of-speech to encourage the editor to introduce explicit discourse markers, and we fine-tune a label-aware text infilling model to generate candidate edits given the masked input, which is repeated until there is an increase in likelihood of the desired control code.<\/span><\/p>\n\n<\/div>\n<figure class=\"image-figure image-figure__center image-figure--has-small-image\" data-animation=\"\">\n    <img loading=\"lazy\" decoding=\"async\" width=\"768\" height=\"187\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png\" class=\"attachment-medium_large size-medium_large image-figure__image image-figure__image--primary\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png 280w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png 1194w\" sizes=\"(max-width: 768px) 100vw, 768px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"1194\" height=\"290\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png 1194w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image3.png 280w\" sizes=\"(max-width: 1194px) 100vw, 1194px\" \/>\n    <figcaption class='image-figure__caption'><b><i>Figure 2:<\/i><\/b> Our system\u2019s generation process. We first generate word xi by sampling from a language model which is controlled by a discriminator reflecting the desired discourse tag. Then, during our editing stage, the discriminator helps identify class-salient words, which are then masked and re-generated to boost the likelihood of the desired discourse tag.<\/figcaption>\n<\/figure>\n<div class='bb-wysiwyg'>\n    \n    <p><span style=\"font-weight: 400;\">We experiment on the <\/span><a href=\"https:\/\/aclanthology.org\/2020.acl-main.478.pdf\" target=\"_blank\" rel=\"noopener noreferrer\"><span style=\"font-weight: 400;\">NewsDiscourse dataset<\/span><\/a><span style=\"font-weight: 400;\"> and conduct human evaluation against four metrics: Accuracy, Grammar, Logical Flow, and Topicality. Our results show that (1) past structural information boosts class accuracy the most; (2) weak discriminators can still impose accurate control; and (3) editing has an overall positive effect on both generation accuracy and quality.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In text generation, the discourse structure impacts both human and machine comprehension. Although naive language models have made impressive advancements and generate fluent text, the text is structurally dissimilar to text written by humans. We envision this system will enable journalists to quickly prototype different structures for their work, or fill in missing structural components to aid in human-in-the-loop computational journalism.<\/span><\/p>\n<p><b>How does your research advance the state-of-the-art in the field of natural language processing?<\/b><\/p>\n<p><b>Xinyu: <\/b><span style=\"font-weight: 400;\">The controllability issue of neural generation models is the key hurdle for their real world adoption. Existing controllable models focus on single control code (i.e., one signal per document). Our work tackles structured control, which allows for more fine-grained, sequential control based on discourse structures. Our system could vary the degree of control, from local-only to past-aware, and even to full-sequence control. Furthermore, we employ editing as part of our pipeline to continuously improve the output quality. We showcase this system on a news dataset and conduct extensive human evaluation to confirm the improved generation quality and resemblance to human-written text over the original, uncontrolled GPT-based model.<\/span><\/p>\n\n<\/div>\n\n<\/div>\n\n\n\t\t\n\t<\/div>\n<\/div>\n<div class='bbg-row bbg-bg--white ' data-anchor='row-6a0ab7125ec04'>\n  \n\t\n\t\n\t<div class=\"bbg-row--content\">\n\t\t\n\t\t\t<div class='bbg-column'>\n\t<div class='interstitial' data-element='interstitial-component'>\n\t<div class='interstitial-blue_border_design interstitial-bg-white'>\n\t\t<div class='interstitial-blue_border_design__rest bbg-column--width-7'>\n\t\t\t<p class='interstitial-blue_border_design__the_title interstitial_title'>Make it happen here.<\/p>\n\t\t\t<p class='interstitial-blue_border_design__text interstitial_text'><\/p>\n\t\t<\/div>\n\t\t<a class='interstitial-blue_border_design__link interstitial__link bbg-column--width-3 bbg-button bbg-button--size-large' href='https:\/\/bloomberg.avature.net\/en_US\/careers\/SearchJobs?utm_medium=mktg_site&utm_content=company_interstitial&utm_source=website' target='_blank' rel='noopener noreferrer' data-element='interstitial' data-description='' data-label='SEARCH NOW' data-element-position='[@data-element-position]'>SEARCH NOW<\/a>\n\t<\/div>\n<\/div>\n\n<\/div>\n\n\n\t\t\n\t<\/div>\n<\/div>\n<div class='bbg-row bbg-bg--grey ' data-anchor='row-6a0ab71260263'>\n  \n\t\n\t\n\t<div class=\"bbg-row--content\">\n\t\t\n\t\t\t<div class='bbg-column bbg-column--width-8 bbg-column--offset-2'>\n\t<div class='bb-wysiwyg'>\n    \n    <p><strong>Realistic Data Augmentation Framework for Enhancing Tabular Reasoning<\/strong><br \/>\nDibyakanti Kumar (IIT Guwahati), Vivek Gupta (University of Utah\/Bloomberg Data Science Ph.D. Fellow), Soumya Sharma (IIT Kharagpur), Shuo Zhang (Bloomberg)<\/p>\n<p><em>Findings of EMNLP 2022<\/em><\/p>\n\n<\/div>\n<div class=\"bb-separator\" data-color=\"\">\n\t<hr class=\"bb-separator__rule\">\n<\/div>\n<div class='bb-wysiwyg'>\n    \n    <p><strong>Please summarize your research. Why are your results notable?<\/strong><\/p>\n<p><strong>Shuo:<\/strong> The challenge of classifying a hypothesis as entailment, contradiction, or neutral depending on the provided premise is known as Natural Language Inference (NLI). For NLI tasks like semi-structured table reasoning, there are currently two main ways to create training data: via crowdsourcing or through fully automatic techniques. Notably, the former restricts scalability since it is costly and time-consuming, while the latter frequently yields simplistic instances that could lack complex reasoning.<\/p>\n<p>Our work develops a realistic semi-automated framework for data augmentation for tabular inference. Instead of manually generating a hypothesis for each table, our methodology generates hypothesis templates that are transferable to similar tables (see Table 1 for examples). In addition, our framework entails the creation of rational counterfactual tables based on human-written logical constraints and premise paraphrasing. We employ <a href=\"https:\/\/infotabs.github.io\/\" target=\"_blank\" rel=\"noopener noreferrer\">InfoTabs<\/a>, an entity-centric tabular inference dataset, for our study. We found that our methodology could provide examples of tabular inference that resembled those made by humans. As such, this approach could help with training data augmentation, especially in the case of limited supervision.<\/p>\n\n<\/div>\n<figure class=\"image-figure image-figure--has-small-image\" data-animation=\"\">\n    <img loading=\"lazy\" decoding=\"async\" width=\"1778\" height=\"442\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png\" class=\"attachment-full size-full image-figure__image image-figure__image--primary\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 1778w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 1536w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 280w\" sizes=\"(max-width: 1778px) 100vw, 1778px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"1778\" height=\"442\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 1778w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 1536w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image1.png 280w\" sizes=\"(max-width: 1778px) 100vw, 1778px\" \/>\n    <figcaption class='image-figure__caption'><span style=\"font-weight: 400\"><strong>Table 1.<\/strong> An example of an original and counterfactual table in the &#8220;Person&#8221; category. Here, we illustrate how multiple operations can be used to alter different keys. We also have shown how the labels (E &#8211; Entail, C &#8211; Contradict) for a specific hypothesis can be altered. In the \u201cJanet Leigh\u201d sample table, the first column represents the keys (e.g., Born, Died, etc.), while the second column contains the relevant values (e.g., July 6, 1927; October 3, 2004, etc.).<br \/>\n<\/span><\/figcaption>\n<\/figure>\n<div class='bb-wysiwyg'>\n    \n    <p><strong>How does your research advance the state-of-the-art in the field of natural language processing?<\/strong><\/p>\n<p>Existing approaches for curating training data are limited in scale or suffer from biases, especially for reasoning on semi-structured tabular data. Our proposed semi-automatic framework can exploit the tabular structure for hypothesis generation, which can then be further transferred to similar tables. Data augmentation would then be another option for curating training data for table reasoning tasks.<\/p>\n\n<\/div>\n\n<\/div>\n\n\n\t\t\n\t<\/div>\n<\/div>\n<div class='bbg-row bbg-bg--white ' data-anchor='row-6a0ab71265dd2'>\n  \n\t\n\t\n\t<div class=\"bbg-row--content\">\n\t\t\n\t\t\t<div class='bbg-column bbg-column--width-8 bbg-column--offset-2'>\n\t<div class='bb-wysiwyg'>\n    \n    <p><strong>Weakly Supervised Headline Dependency Parsing<\/strong><br \/>\nAdrian Benton (Bloomberg), Tianze Shi (Cornell University\/Bloomberg Data Science Ph.D. Fellow), Ozan \u0130rsoy\/Igor Malioutov (Bloomberg)<\/p>\n<p><em>Findings of EMNLP 2022<\/em><\/p>\n\n<\/div>\n<div class=\"bb-separator\" data-color=\"\">\n\t<hr class=\"bb-separator__rule\">\n<\/div>\n<div class='bb-wysiwyg'>\n    \n    <p><strong>Please summarize your research. Why are your results notable?<\/strong><\/p>\n<p><strong>Igor:<\/strong> The unique syntactic properties of English news headlines have been noted in linguistics literature since the 1930s. However, headlines have received surprisingly little attention from the Natural Language Processing (NLP) community in the context of automatic syntactic analysis or parsing. This presents an important limitation regarding headline processing, since correctly analyzing the syntactic structure of text is often critical for accurate semantic understanding, as well as the effectiveness of downstream systems for summarization, information extraction, and question answering.<\/p>\n<p>We bridge this gap by providing the first annotated English news headline corpus of Universal Syntactic dependency parse trees, which enables us to evaluate state-of-the-art dependency parsers on news headlines. To improve the accuracy of English news headline parsing, we develop a method to automatically bootstrap noisy training data from pairs of unlabeled headlines and the lede sentences from the body text.<\/p>\n\n<\/div>\n<figure class=\"image-figure image-figure--has-small-image\" data-animation=\"\">\n    <img loading=\"lazy\" decoding=\"async\" width=\"730\" height=\"281\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image8.png\" class=\"attachment-full size-full image-figure__image image-figure__image--primary\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image8.png 730w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image8.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image8.png 280w\" sizes=\"(max-width: 730px) 100vw, 730px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"730\" height=\"281\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image8.png\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image8.png 730w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image8.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image8.png 280w\" sizes=\"(max-width: 730px) 100vw, 730px\" \/>\n    <figcaption class='image-figure__caption'>Example parses given by EWT (baseline) (Bottom) and Both (finetuned) (Top) on an example headline from the Google Sentence Compression Corpus (GSC). Differing edges are highlighted in green and red for finetuned and baseline, respectively.<\/figcaption>\n<\/figure>\n<div class='bb-wysiwyg'>\n    \n    <p>Our approach is based on a key observation: headlines convey similar semantic content as the news story body \u2013 and they typically share many local substructures. The first sentence in an article, known as a lede sentence, often serves a similar function as the headline. It is meant to grab the reader\u2019s attention and states essential facts about a news event; lede sentences are sometimes direct expansions of the headlines.<\/p>\n<p>Our bootstrapping algorithm projects and carries over the syntactic dependency analysis from the lede sentence to the headline by pruning the lede sentence&#8217;s dependency trees. Because existing models are accurate in parsing long-form news body texts, the resulting silver headline parsed from our bootstrapping algorithm is typically more accurate than applying existing models to headlines directly.<\/p>\n\n<\/div>\n<figure class=\"image-figure image-figure--has-small-image\" data-animation=\"\">\n    <img loading=\"lazy\" decoding=\"async\" width=\"409\" height=\"184\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image5.png\" class=\"attachment-full size-full image-figure__image image-figure__image--primary\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image5.png 409w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image5.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image5.png 280w\" sizes=\"(max-width: 409px) 100vw, 409px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"409\" height=\"184\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image5.png\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image5.png 409w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image5.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image5.png 280w\" sizes=\"(max-width: 409px) 100vw, 409px\" \/>\n    <figcaption class='image-figure__caption'>An example promotion during our projection algorithm. &#8220;release&#8221; is promoted to be the new root of the headline.<\/figcaption>\n<\/figure>\n<div class='bb-wysiwyg'>\n    \n    <p><strong>How does your research advance the state-of-the-art in the field of natural language processing?<\/strong><\/p>\n<p>Models trained on our bootstrapped silver headline parses demonstrate significant improvements in performance over models trained solely on gold-annotated long-form texts &#8212; after training with silver parses, we can accurately identify the main predicate\/verb of a sentence up to 98% of the time (up from 75%), and can correctly identify passive construction with up to 91% accuracy (from 11%). Furthermore, we show that these gains translate to downstream improvements in the quality of output extracted by a state-of-the-art open domain information extraction system from headlines.<\/p>\n<p>We hope our data, models, and methodology will encourage further research to improve dependency parsers for overlooked registers of English. In addition, we hope the development of accurate headline dependency parsers will improve the performance of existing headline understanding and processing tasks and enable more subtle linguistic analysis, such as the identification of &#8220;<a href=\"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=1693emnlp 2022\" target=\"_blank\" rel=\"noopener noreferrer\">crash blossoms<\/a>&#8221; (i.e., ambiguous news headlines).<\/p>\n\n<\/div>\n\n<\/div>\n\n\n\t\t\n\t<\/div>\n<\/div>\n<div class='bbg-row bbg-bg--grey ' data-anchor='row-6a0ab7126fe69'>\n  \n\t\n\t\n\t<div class=\"bbg-row--content\">\n\t\t\n\t\t\t<div class='bbg-column bbg-column--width-8 bbg-column--offset-2'>\n\t<div class='bb-wysiwyg'>\n    \n    <p><strong>What Makes Data-to-Text Generation Hard for Pretrained Language Models?<\/strong><br \/>\nMoniba Keymanesh, Adrian Benton, Mark Dredze (Bloomberg)<\/p>\n<p><a href=\"https:\/\/gem-benchmark.com\/workshop\" target=\"_blank\" rel=\"noopener noreferrer\">GEM Workshop<\/a>, Virtual Poster Session (Wednesday, December 7, 2022 @ 9:00 PM GST)<\/p>\n\n<\/div>\n<figure class=\"image-figure image-figure--has-small-image\" data-animation=\"\">\n    <img loading=\"lazy\" decoding=\"async\" width=\"3456\" height=\"2304\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg\" class=\"attachment-full size-full image-figure__image image-figure__image--primary\" alt=\"Image of the poster for the paper &quot;What Makes Data-to-Text Generation Hard for Pretrained Language Models?&quot; that Moniba Keymanesh, Adrian Benton &amp; Mark Dredze are presenting during the virtual poster session in the 2nd Generation, Evaluation &amp; Metrics (GEM) Workshop at EMNLP 2022 on Wednesday, December 7, 2022.\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 3456w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 1536w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 2048w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 280w\" sizes=\"(max-width: 3456px) 100vw, 3456px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"3456\" height=\"2304\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"Image of the poster for the paper &quot;What Makes Data-to-Text Generation Hard for Pretrained Language Models?&quot; that Moniba Keymanesh, Adrian Benton &amp; Mark Dredze are presenting during the virtual poster session in the 2nd Generation, Evaluation &amp; Metrics (GEM) Workshop at EMNLP 2022 on Wednesday, December 7, 2022.\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 3456w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 1536w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 2048w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/GEM-2022-Poster.jpg 280w\" sizes=\"(max-width: 3456px) 100vw, 3456px\" \/>\n    <figcaption class='image-figure__caption'>The poster for the paper &#8220;What Makes Data-to-Text Generation Hard for Pretrained Language Models?&#8221; that Moniba Keymanesh, Adrian Benton &amp; Mark Dredze are presenting during the virtual poster session in the 2nd Generation, Evaluation &amp; Metrics (GEM) Workshop at EMNLP 2022 on Wednesday, December 7, 2022.<\/figcaption>\n<\/figure>\n<div class=\"bb-separator\" data-color=\"\">\n\t<hr class=\"bb-separator__rule\">\n<\/div>\n<div class='bb-wysiwyg'>\n    \n    <p><strong>Please summarize your research. Why are your results notable?<\/strong><\/p>\n<p><strong>Moniba:<\/strong> In its role as a leading provider of financial information, Bloomberg maintains a massive database of structured information. Access to this information usually comes in the form of API calls. While users can access this information programmatically, delivering it to users often requires formatting the information as natural language. For example, a Bloomberg journalist may fact-check information about a company or look up an executive\u2019s bio using the Bloomberg Knowledge Graph (BBKG) before incorporating it into a story. This process can even be semi-automated or fully automated, but both processes initially require complex engineering efforts. Moreover, question answering from the Terminal commandline seamlessly provides direct access to structured information, but supporting different question and information types requires significant engineering effort.<\/p>\n<p>The central challenge across these tasks is how to take structured information and express it as fluent and accurate natural language. <strong>The goal of this project is to develop a system that takes as input one or more sets of structured facts or relations (triples) and produces natural language text that expresses this information.<\/strong> This task is closely related to <strong>data-to-text generation<\/strong> in natural language processing literature. Automated data-to-text generation systems take as input a set of relations, where each relation is a (subject, predicate, object) triple(see an example below). Applications of this technology include story or dialogue generation, open-domain question-answering, and text summarization. Domains may span journalism, weather, finance, sports, and even the summarization of patient medical histories.<\/p>\n\n<\/div>\n<figure class=\"image-figure image-figure__center image-figure--has-small-image\" data-animation=\"\">\n    <a class='image-figure__link' href='https:\/\/aclanthology.org\/2022.findings-acl.36.pdf' target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" width=\"1282\" height=\"314\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png\" class=\"attachment-full size-full image-figure__image image-figure__image--primary\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png 1282w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png 280w\" sizes=\"(max-width: 1282px) 100vw, 1282px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"1282\" height=\"314\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png 1282w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image6.png 280w\" sizes=\"(max-width: 1282px) 100vw, 1282px\" \/><\/a>\n    <figcaption class='image-figure__caption'>An input\/output example of data-to-text generation.<\/figcaption>\n<\/figure>\n<div class='bb-wysiwyg'>\n    \n    <p>Previous work shows that pre-trained language models (PLMs) perform remarkably well on this task after supervised learning on a significant amount of data. However, dataset creation for this task can be challenging and not feasible in many domains. On the other hand, while auto-regressive PLMs such as GPT can generalize from a few task examples, their data efficacy at data-to-text is largely unexplored. Furthermore, we have an incomplete understanding of the limits of PLMs on this task. These issues make the path forward for data-to-text generation research unclear.<\/p>\n<p>In this work, we conduct an evaluation of PLMs for data-to-text generation generation, focusing on two classes of challenging examples: examples with novel (unseen) relations (predicates) and examples where the source and target sequences are lexically very different (i.e., not amenable to purely extractive data-to-text generation systems). We consider how GPT-2, adapted with few-shot learning, prompt tuning, and the addition of predicate descriptions, performs on these example classes as compared to a state-of-the-art fine-tuned T5. While GPT-2 performs poorly on DART in the zero-shot setting, we show that its performance can be drastically improved by employing the above techniques.<\/p>\n<p><strong>How does your research advance the state-of-the-art in the field of natural language processing?<\/strong><\/p>\n<p>In this work, we contribute to the data-to-text generation research by benchmarking and analyzing the limitations of two popular PLMs on the multi-domain DART dataset. We also provide recommendations for future model and dataset research in data-to-text generation. Essentially, we make the following contributions:<\/p>\n<ul>\n<li>We evaluate GPT2-XL and fine-tuned T5 for data-to-text generation. While the zero-shot GPT model performs poorly, we evaluate several strategies to improve performance, including few-shot learning and prompt tuning. Both provide significant improvements on the DART dataset.<\/li>\n<\/ul>\n\n<\/div>\n<figure class=\"image-figure image-figure__center image-figure--has-small-image\" data-animation=\"\">\n    <a class='image-figure__link' href='https:\/\/aclanthology.org\/2022.findings-acl.36.pdf' target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" width=\"758\" height=\"670\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image2.png\" class=\"attachment-full size-full image-figure__image image-figure__image--primary\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image2.png 758w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image2.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image2.png 215w\" sizes=\"(max-width: 758px) 100vw, 758px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"758\" height=\"670\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image2.png\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image2.png 758w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image2.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image2.png 215w\" sizes=\"(max-width: 758px) 100vw, 758px\" \/><\/a>\n    <figcaption class='image-figure__caption'>A customized zero-shot prompt for GPT.<\/figcaption>\n<\/figure>\n<div class='bb-wysiwyg'>\n    \n    <ul>\n<li>We compare model performance on two classes of difficult examples: examples with unseen predicates and abstractive examples (i.e., examples where source and target sequences are lexically dissimilar). We investigate whether including predicate descriptions in the prompt can improve the ability of PLMs on these classes.<\/li>\n<li>We conduct a human evaluation of PLMs to quantify the prevalence of errors such as hallucination and missing information in generations as a function of the model adaptation technique. We find that a re-ranking strategy for few-shot GPT2-XL, despite having little effect on automatic metrics like BLEU, reduces the incidence of missing information, without requiring additional training data.<\/li>\n<\/ul>\n\n<\/div>\n\n<\/div>\n\n\n\t\t\n\t<\/div>\n<\/div>\n<div class='bbg-row bbg-bg--white ' data-anchor='row-6a0ab7127d219'>\n  \n\t\n\t\n\t<div class=\"bbg-row--content\">\n\t\t\n\t\t\t<div class='bbg-column bbg-column--width-8 bbg-column--offset-2'>\n\t<div class='bb-wysiwyg'>\n    \n    <p><strong>Entity Retrieval from Multilingual Knowledge Graphs<\/strong><br \/>\nSaher Esmeir (Bloomberg), Arthur C\u00e2mara (Delft University of Technology), Edgar Meij (Bloomberg)<\/p>\n<p><a href=\"https:\/\/sigtyp.github.io\/ws2022-mrl.html\" target=\"_blank\" rel=\"noopener noreferrer\">MRL Workshop<\/a>, Poster Session (Thursday, December 8, 2022 @ 11 AM GST)<\/p>\n\n<\/div>\n<figure class=\"image-figure image-figure--has-small-image\" data-animation=\"\">\n    <img loading=\"lazy\" decoding=\"async\" width=\"2000\" height=\"1125\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg\" class=\"attachment-full size-full image-figure__image image-figure__image--primary\" alt=\"Image of the poster for the paper &quot;Entity Retrieval from Multilingual Knowledge Graphs&quot; that Saher Esmeir, Arthur C\u00e2mara &amp; Edgar Meij are presenting during the poster session of the the 2nd Multilingual Representation Learning (MRL) Workshop at EMNLP 2022 on Thursday, December 8, 2022.\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 2000w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 1536w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 280w\" sizes=\"(max-width: 2000px) 100vw, 2000px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"2000\" height=\"1125\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"Image of the poster for the paper &quot;Entity Retrieval from Multilingual Knowledge Graphs&quot; that Saher Esmeir, Arthur C\u00e2mara &amp; Edgar Meij are presenting during the poster session of the the 2nd Multilingual Representation Learning (MRL) Workshop at EMNLP 2022 on Thursday, December 8, 2022.\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 2000w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 1536w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/MRL22-virtaul-poster-FINAL.jpg 280w\" sizes=\"(max-width: 2000px) 100vw, 2000px\" \/>\n    <figcaption class='image-figure__caption'>The poster for the paper &#8220;Entity Retrieval from Multilingual Knowledge Graphs&#8221; that Saher Esmeir, Arthur C\u00e2mara &amp; Edgar Meij are presenting during the poster session of the the 2nd Multilingual Representation Learning (MRL) Workshop at EMNLP 2022 on Thursday, December 8, 2022.<\/figcaption>\n<\/figure>\n<div class=\"bb-separator\" data-color=\"\">\n\t<hr class=\"bb-separator__rule\">\n<\/div>\n<div class='bb-wysiwyg'>\n    \n    <p><strong>Please summarize your research. Why are your results notable?<\/strong><\/p>\n<p><strong>Saher:<\/strong> Given a knowledge graph (KG) and a user query, the task of entity retrieval aims to retrieve a ranked list of entities ordered by relevance to the query. In this work, we define and address the multilingual entity retrieval task in which the user queries, as well as the entities in the KG, may be represented in multiple, possibly distinct, languages.<\/p>\n<p>Due to different data sources and points of view, information in different languages may be similar, complementary, or conflicting. To benefit from this diversity, we propose to leverage multilingual language models. In the training stage, we fine-tune the language models on data from multiple languages. In the retrieval stage, we use machine translation to obtain results for different versions of the same query and then blend the scores to produce our final ranking.<\/p>\n\n<\/div>\n<figure class=\"image-figure image-figure__center image-figure--has-small-image\" data-animation=\"\">\n    <a class='image-figure__link' href='https:\/\/aclanthology.org\/2022.findings-acl.36.pdf' target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" width=\"1133\" height=\"929\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png\" class=\"attachment-full size-full image-figure__image image-figure__image--primary\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png 1133w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png 232w\" sizes=\"(max-width: 1133px) 100vw, 1133px\" \/><img loading=\"lazy\" decoding=\"async\" width=\"1133\" height=\"929\" src=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png\" class=\"attachment-full size-full image-figure__image image-figure__image--small\" alt=\"\" srcset=\"https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png 1133w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png 1024w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png 768w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&amp;type=webp&amp;url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/image7.png 232w\" sizes=\"(max-width: 1133px) 100vw, 1133px\" \/><\/a>\n    <figcaption class='image-figure__caption'>A sample multilingual subgraph. The information in different languages may be similar, complementary, or conflicting. For example, the information regarding the origin of Falafel is different between Arabic and Hebrew, and unavailable in English.<\/figcaption>\n<\/figure>\n<div class='bb-wysiwyg'>\n    \n    <p><strong>How does your research advance the state-of-the-art in the field of natural language processing?<\/strong><\/p>\n<p>The performance of our system on the standard test collection, <a href=\"https:\/\/github.com\/iai-group\/DBpedia-Entity\" target=\"_blank\" rel=\"noopener noreferrer\">DBpedia Entity v2<\/a>, where both the knowledge graph and the query are in English, is comparable to that of state-of-the-art methods. However, thanks to its simplicity and flexibility, our method can be extended to virtually any language that is represented in the searched graph, producing strong baseline results for many languages.<\/p>\n<p>Furthermore, we show that, even for highly-resourced languages such as English, taking information from other languages can significantly improve the retrieval performance when relevant coverage in multiple languages is available.<\/p>\n<p>Finally, we provide a resource for multilingual entity retrieval by extending the English-only DBpedia Entity v2. The extended version provides judged relevance scores for query-entity pairs in any language, provided machine translation is supported and the entity exists in the DBpedia edition of that language.<\/p>\n\n<\/div>\n\n<\/div>\n\n\n\t\t\n\t<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Bloomberg&#8217;s five EMNLP 2022 research papers highlight a variety of state-of-the-art applications, novel approaches, and improved models used in key NLP tasks<\/p>\n","protected":false},"author":184,"featured_media":28187,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1466],"tags":[1498,1578,1637,1572,1472,1485,1624,1486,1638,1580,1591],"class_list":["post-28133","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-tech-at-bloomberg","tag-ai","tag-artificial-intelligence","tag-computational-linguistics","tag-cto","tag-data-science","tag-machine-learning","tag-ml","tag-natural-language-processing","tag-neural-ranking","tag-nlp","tag-sentiment"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v19.11 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Bloomberg\u2019s AI Engineering Group &amp; CTO Office Publish 5 NLP Research Papers at EMNLP 2022<\/title>\n<meta name=\"description\" content=\"Bloomberg&#039;s five EMNLP 2022 research papers highlight a variety of applications, novel approaches, and improved models used in key NLP tasks.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Bloomberg\u2019s AI Engineering Group &amp; CTO Office Publish 5 NLP Research Papers at EMNLP 2022\" \/>\n<meta property=\"og:description\" content=\"Bloomberg&#039;s five EMNLP 2022 research papers highlight a variety of applications, novel approaches, and improved models used in key NLP tasks.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/\" \/>\n<meta property=\"og:site_name\" content=\"Bloomberg L.P.\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/bloomberglp\/\" \/>\n<meta property=\"article:published_time\" content=\"2022-12-07T12:14:18+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-04-12T14:21:03+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/EMNLP.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"617\" \/>\n\t<meta property=\"og:image:height\" content=\"403\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"chaas30\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:image\" content=\"https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/EMNLP.jpg\" \/>\n<meta name=\"twitter:creator\" content=\"@bloomberg\" \/>\n<meta name=\"twitter:site\" content=\"@bloomberg\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"chaas30\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"17 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/\",\"url\":\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/\",\"name\":\"Bloomberg\u2019s AI Engineering Group & CTO Office Publish 5 NLP Research Papers at EMNLP 2022\",\"isPartOf\":{\"@id\":\"https:\/\/www.bloomberg.com\/company\/#website\"},\"datePublished\":\"2022-12-07T12:14:18+00:00\",\"dateModified\":\"2024-04-12T14:21:03+00:00\",\"author\":{\"@id\":\"https:\/\/www.bloomberg.com\/company\/#\/schema\/person\/4d4a18aae79d6fcc1ea98181a906905e\"},\"description\":\"Bloomberg's five EMNLP 2022 research papers highlight a variety of applications, novel approaches, and improved models used in key NLP tasks.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":\"1\",\"name\":\"Home\",\"item\":\"https:\/\/www.bloomberg.com\/company\/\"},{\"@type\":\"ListItem\",\"position\":\"2\",\"name\":\"Bloomberg\u2019s AI Engineering Group &#038; CTO Office Publish 5 NLP Research Papers at EMNLP 2022\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.bloomberg.com\/company\/#website\",\"url\":\"https:\/\/www.bloomberg.com\/company\/\",\"name\":\"Bloomberg L.P.\",\"description\":\"Bloomberg L.P. is the leader in global business and financial information, enabling customers to make smarter, faster, more informed business decisions.\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.bloomberg.com\/company\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.bloomberg.com\/company\/#\/schema\/person\/4d4a18aae79d6fcc1ea98181a906905e\",\"name\":\"Bloomberg L.P.\",\"url\":\"https:\/\/www.bloomberg.com\/company\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Bloomberg\u2019s AI Engineering Group & CTO Office Publish 5 NLP Research Papers at EMNLP 2022","description":"Bloomberg's five EMNLP 2022 research papers highlight a variety of applications, novel approaches, and improved models used in key NLP tasks.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/","og_locale":"en_US","og_type":"article","og_title":"Bloomberg\u2019s AI Engineering Group & CTO Office Publish 5 NLP Research Papers at EMNLP 2022","og_description":"Bloomberg's five EMNLP 2022 research papers highlight a variety of applications, novel approaches, and improved models used in key NLP tasks.","og_url":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/","og_site_name":"Bloomberg L.P.","article_publisher":"https:\/\/www.facebook.com\/bloomberglp\/","article_published_time":"2022-12-07T12:14:18+00:00","article_modified_time":"2024-04-12T14:21:03+00:00","og_image":[{"width":617,"height":403,"url":"https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/EMNLP.jpg","type":"image\/jpeg"}],"author":"chaas30","twitter_card":"summary_large_image","twitter_image":"https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/EMNLP.jpg","twitter_creator":"@bloomberg","twitter_site":"@bloomberg","twitter_misc":{"Written by":"chaas30","Est. reading time":"17 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/","url":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/","name":"Bloomberg\u2019s AI Engineering Group & CTO Office Publish 5 NLP Research Papers at EMNLP 2022","isPartOf":{"@id":"https:\/\/www.bloomberg.com\/company\/#website"},"datePublished":"2022-12-07T12:14:18+00:00","dateModified":"2024-04-12T14:21:03+00:00","author":{"@id":"https:\/\/www.bloomberg.com\/company\/#\/schema\/person\/4d4a18aae79d6fcc1ea98181a906905e"},"description":"Bloomberg's five EMNLP 2022 research papers highlight a variety of applications, novel approaches, and improved models used in key NLP tasks.","breadcrumb":{"@id":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.bloomberg.com\/company\/stories\/bloombergs-ai-engineering-group-cto-publish-5-nlp-research-papers-at-emnlp-2022\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":"1","name":"Home","item":"https:\/\/www.bloomberg.com\/company\/"},{"@type":"ListItem","position":"2","name":"Bloomberg\u2019s AI Engineering Group &#038; CTO Office Publish 5 NLP Research Papers at EMNLP 2022"}]},{"@type":"WebSite","@id":"https:\/\/www.bloomberg.com\/company\/#website","url":"https:\/\/www.bloomberg.com\/company\/","name":"Bloomberg L.P.","description":"Bloomberg L.P. is the leader in global business and financial information, enabling customers to make smarter, faster, more informed business decisions.","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.bloomberg.com\/company\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.bloomberg.com\/company\/#\/schema\/person\/4d4a18aae79d6fcc1ea98181a906905e","name":"Bloomberg L.P.","url":"https:\/\/www.bloomberg.com\/company"}]}},"featured_image_rendered":"<img srcset='https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&type=webp&url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/EMNLP.jpg 280w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&type=webp&url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/EMNLP.jpg 300w, https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&type=webp&url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/EMNLP.jpg 617w' src='https:\/\/assets.bbhub.io\/image\/v1\/resize?width=auto&type=webp&url=https:\/\/assets.bbhub.io\/company\/sites\/51\/2022\/12\/EMNLP.jpg' alt='' \/>","category_info":{"name":"Tech At Bloomberg","blog_landing_name":"Tech At Bloomberg"},"_links":{"self":[{"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/posts\/28133","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/users\/184"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/comments?post=28133"}],"version-history":[{"count":10,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/posts\/28133\/revisions"}],"predecessor-version":[{"id":35539,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/posts\/28133\/revisions\/35539"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/media\/28187"}],"wp:attachment":[{"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/media?parent=28133"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/categories?post=28133"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.bloomberg.com\/company\/wp-json\/wp\/v2\/tags?post=28133"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}