Man Technology Event - ML Meetup: Modeling Reward and Abductive Learning

Man Technology Team

Man Technology

At Man Group, we believe in the Python Ecosystem and have been trading Machine Learning based systems since early 2014.

August 2019

To give back and strengthen London’s Python and Machine Learning Communities, we sponsor and support the PyData and Machine Learning London Meetups.

In August, we had the pleasure of welcoming Edward Grefenstette, research scientist at Facebook AI Research, and Wang-Zhou Dai, research associate in the Department of Computing at Imperial College London, to the London Machine Learning Meetup.

Teaching Artificial Agents to Understand Language by Modelling Reward - Edward Grefenstette

Recent progress in Deep Reinforcement Learning has shown that agents can be taught complex behaviour and solve difficult tasks, such as playing video games from pixel observations, or mastering the game of Go without observing human games, with relatively little prior information. Building on these successes, researchers such as Hermann and colleagues have sought to apply these methods to teach-in simulation-agents to complete a variety of tasks specified by combinatorially rich instruction languages. In this talk, we discuss some of these highlights and some of the limitations which inhibit scalability of such approaches to more complex instruction languages (including natural language). Following this, we introduce a new approach, inspired by recent work in adversarial reward modelling, which constitutes a first step towards scaling instruction-conditional agent training to “real world” language.

Edward Grefenstette

Edward Grefenstette is a Research Scientist at Facebook AI Research, and Honorary Associate Professor at UCL. He previously was, in reverse order, a Staff Research Scientist at DeepMind, the CTO of Dark Blue Labs, and a Junior Research Fellow within Oxford’s Department of Computer Science and Somerville College. His recent research has covered topics at the intersection of deep learning and machine reasoning, addressing questions such as how neural networks can model or understand logic and mathematics, infer implicit or human-readable programs, or learn to understand instructions from simulation.

Bridging Machine Learning and Logical Reasoning by Abductive Learning - Wang-Zhou Dai

Perception and reasoning are two representative abilities of intelligence that are integrated seamlessly during problem-solving processes. In the area of artificial intelligence (AI), perception is usually realised by machine learning and reasoning is often formalised by logic programming. However, the two categories of techniques were developed separately throughout most of the history of AI. This talk will introduce the abductive learning framework targeted at unifying the two AI paradigms in a mutually beneficial way. In this framework, machine learning models learn to perceive primitive logical facts from the raw data, while logical reasoning is able to correct the wrongly perceived facts for improving the machine learning models. We demonstrate that by using the abductive learning framework, computers can learn to recognise numbers and resolve equations with unknown arithmetic operations simultaneously from images of simple hand-written equations. Moreover, the learned models can be generalized to complex equations and adapted to different tasks, which is beyond the capability of state-of-the-art deep learning models.

Wang-Zhou Dai

Wang-Zhou Dai is a research associate in the Department of Computing, Imperial College London. His research interests lie in the area of Artificial Intelligence and machine learning, especially in applying first-order logical background knowledge in general machine learning techniques. He has published multiple research papers on major conferences and journals in AI and machine learning including AAAI, ILP, ICDM, ACML and Machine Learning, etc. He has been awarded the IBM PhD Fellowship and Google Excellence Scholarship during his PhD study, and now he is serving as a PC member and reviewer in many top AI & machine learning conferences.

I am interested in other Tech Articles.

To receive e-mail alerts whenever new Tech Articles or Events are posted on this site, please subscribe below.

Find out more about Technology at Man Group

Important information

In the case of hypothetical results:

Hypothetical Results are calculated in hindsight, invariably show positive rates of return, and are subject to various modeling assumptions, statistical variances and interpretational differences. No representation is made as to the reasonableness or accuracy of the calculations or assumptions made or that all assumptions used in achieving the results have been utilized equally or appropriately, or that other assumptions should not have been used or would have been more accurate or representative. Changes in the assumptions would have a material impact on the Hypothetical Results and other statistical information based on the Hypothetical Results.

The Hypothetical Results have other inherent limitations, some of which are described below. They do not involve financial risk or reflect actual trading by an Investment Product, and therefore do not reflect the impact that economic and market factors, including concentration, lack of liquidity or market disruptions, regulatory (including tax) and other conditions then in existence may have on investment decisions for an Investment Product. In addition, the ability to withstand losses or to adhere to a particular trading program in spite of trading losses are material points which can also adversely affect actual trading results. Since trades have not actually been executed, Hypothetical Results may have under or over compensated for the impact, if any, of certain market factors. There are frequently sharp differences between the Hypothetical Results and the actual results of an Investment Product. No assurance can be given that market, economic or other factors may not cause the Investment Manager to make modifications to the strategies over time. There also may be a material difference between the amount of an Investment Product’s assets at any time and the amount of the assets assumed in the Hypothetical Results, which difference may have an impact on the management of an Investment Product. Hypothetical Results should not be relied on, and the results presented in no way reflect skill of the investment manager. A decision to invest in an Investment Product should not be based on the Hypothetical Results.

No representation is made that an Investment Product’s performance would have been the same as the Hypothetical Results had an Investment Product been in existence during such time or that such investment strategy will be maintained substantially the same in the future; the Investment Manager may choose to implement changes to the strategies, make different investments or have an Investment Product invest in other investments not reflected in the Hypothetical Results or vice versa. To the extent there are any material differences between the Investment Manager’s management of an Investment Product and the investment strategy as reflected in the Hypothetical Results, the Hypothetical Results will no longer be as representative and their illustration value will decrease substantially. No representation is made that an Investment Product will or is likely to achieve its objectives or results comparable to those shown, including the Hypothetical Results, or will make any profit or will be able to avoid incurring substantial losses. Past performance is not indicative of future results and simulated results in no way reflect upon the manger’s skill or ability.

This information is communicated and/or distributed by the relevant Man entity identified below (collectively the "Company") subject to the following conditions and restriction in their respective jurisdictions.

Opinions expressed are those of the author and may not be shared by all personnel of Man Group plc (‘Man’). These opinions are subject to change without notice, are for information purposes only and do not constitute an offer or invitation to make an investment in any financial instrument or in any product to which the Company and/or its affiliates provides investment advisory or any other financial services. Any organisations, financial instrument or products described in this material are mentioned for reference purposes only which should not be considered a recommendation for their purchase or sale. Neither the Company nor the authors shall be liable to any person for any action taken on the basis of the information provided. Some statements contained in this material concerning goals, strategies, outlook or other non-historical matters may be forward-looking statements and are based on current indicators and expectations. These forward-looking statements speak only as of the date on which they are made, and the Company undertakes no obligation to update or revise any forward-looking statements. These forward-looking statements are subject to risks and uncertainties that may cause actual results to differ materially from those contained in the statements. The Company and/or its affiliates may or may not have a position in any financial instrument mentioned and may or may not be actively trading in any such securities. Unless stated otherwise all information is provided by the Company. Past performance is not indicative of future results.

Unless stated otherwise this information is communicated by the relevant entity listed below.

Australia: To the extent this material is distributed in Australia it is communicated by Man Investments Australia Limited ABN 47 002 747 480 AFSL 240581, which is regulated by the Australian Securities & Investments Commission ('ASIC'). This information has been prepared without taking into account anyone’s objectives, financial situation or needs.

Austria/Germany/Liechtenstein: To the extent this material is distributed in Austria, Germany and/or Liechtenstein it is communicated by Man (Europe) AG, which is authorised and regulated by the Liechtenstein Financial Market Authority (FMA). Man (Europe) AG is registered in the Principality of Liechtenstein no. FL-0002.420.371-2. Man (Europe) AG is an associated participant in the investor compensation scheme, which is operated by the Deposit Guarantee and Investor Compensation Foundation PCC (FL-0002.039.614-1) and corresponds with EU law. Further information is available on the Foundation's website under www.eas-liechtenstein.li.

European Economic Area: Unless indicated otherwise this material is communicated in the European Economic Area by Man Asset Management (Ireland) Limited (‘MAMIL’) which is registered in Ireland under company number 250493 and has its registered office at 70 Sir John Rogerson's Quay, Grand Canal Dock, Dublin 2, Ireland. MAMIL is authorised and regulated by the Central Bank of Ireland under number C22513.

Hong Kong SAR: To the extent this material is distributed in Hong Kong SAR, this material is communicated by Man Investments (Hong Kong) Limited and has not been reviewed by the Securities and Futures Commission in Hong Kong.

Japan: To the extent this material is distributed in Japan it is communicated by Man Group Japan Limited, Financial Instruments Business Operator, Director of Kanto Local Finance Bureau (Financial instruments firms) No. 624 for the purpose of providing information on investment strategies, investment services, etc. provided by Man Group, and is not a disclosure document based on laws and regulations. This material can only be communicated only to professional investors (i.e. specific investors or institutional investors as defined under Financial Instruments Exchange Law) who may have sufficient knowledge and experience of related risks.

Switzerland: To the extent this material is made available in Switzerland the communicating entity is:

For Clients (as such term is defined in the Swiss Financial Services Act): Man Investments (CH) AG, Huobstrasse 3, 8808 Pfäffikon SZ, Switzerland. Man Investment (CH) AG is regulated by the Swiss Financial Market Supervisory Authority (‘FINMA’); and
For Financial Service Providers (as defined in Art. 3 d. of FINSA, which are not Clients): Man Investments AG, Huobstrasse 3, 8808 Pfäffikon SZ, Switzerland, which is regulated by FINMA.

United Kingdom: Unless indicated otherwise this material is communicated in the United Kingdom by Man Solutions Limited ('MSL') which is a private limited company registered in England and Wales under number 3385362. MSL is authorised and regulated by the UK Financial Conduct Authority (the 'FCA') under number 185637 and has its registered office at Riverbank House, 2 Swan Lane, London, EC4R 3AD, United Kingdom.

United States: To the extent this material is distributed in the United States, it is communicated and distributed by Man Investments, Inc. (‘Man Investments’). Man Investments is registered as a broker-dealer with the SEC and is a member of the Financial Industry Regulatory Authority (‘FINRA’). Man Investments is also a member of the Securities Investor Protection Corporation (‘SIPC’). Man Investments is a wholly owned subsidiary of Man Group plc. The registration and memberships described above in no way imply a certain level of skill or expertise or that the SEC, FINRA or the SIPC have endorsed Man Investments. Man Investments Inc, 1345 Avenue of the Americas, 21st Floor, New York, NY 10105.

This material is proprietary information and may not be reproduced or otherwise disseminated in whole or in part without prior written consent. Any data services and information available from public sources used in the creation of this material are believed to be reliable. However accuracy is not warranted or guaranteed. © Man 2024

ML Meetup: Modeling Reward and Abductive Learning