Machine Learning Driven Developments in Behavioral Annotation: A Recent Historical Review

Watson, Eleanor; Viana, Thiago; Zhang, Shujun

You are here : University of Gloucestershire > Research > Research Repository

Machine Learning Driven Developments in Behavioral Annotation: A Recent Historical Review

Tools

Watson, Eleanor, Viana, Thiago ORCID: https://orcid.org/0000-0001-9380-4611 and Zhang, Shujun ORCID: https://orcid.org/0000-0001-5699-2676 (2024) Machine Learning Driven Developments in Behavioral Annotation: A Recent Historical Review. International Journal of Social Robotics, 16. pp. 1605-1618. doi:10.1007/s12369-024-01117-1

[thumbnail of 13773 Watson et al (2024) Machine Learning Driven Developments - AAM.pdf]

Preview

Text
13773 Watson et al (2024) Machine Learning Driven Developments - AAM.pdf - Accepted Version
Available under License Publisher's Licence.
Download (883kB) | Preview

Official URL: https://doi.org/10.1007/s12369-024-01117-1

Abstract

Annotation tools serve a critical role in the generation of datasets that fuel machine learning applications. With the advent of Foundation Models, particularly those based on Transformer architectures and expansive language models, the capacity for training on comprehensive, multimodal datasets has been substantially enhanced. This not only facilitates robust generalization across diverse data categories and knowledge domains but also necessitates a novel form of annotation—prompt engineering—for qualitative model finetuning. This advancement creates new avenues for machine intelligence to more precisely identify, forecast, and replicate human behavior, addressing historical limitations that contribute to algorithmic inequities. Nevertheless, the voluminous and intricate nature of the data essential for training multimodal models poses significant engineering challenges, particularly with regard to bias. No consensus has yet emerged on optimal procedures for conducting this annotation work in a manner that is ethically responsible, secure, and efficient. This historical literature review traces advancements in these technologies from 2018 onward, underscores significant contributions, and identifies existing knowledge gaps and avenues for future research pertinent to the development of Transformer-based multimodal Foundation Models. An initial survey of over 724 articles yielded 156 studies that met the criteria for historical analysis; these were further narrowed down to 46 key papers spanning the years 2018-2022. The review offers valuable perspectives on the evolution of best practices, pinpoints current knowledge deficiencies, and suggests potential directions for future research. The paper includes six figures and delves into the transformation of research landscapes in the realm of machine-assisted behavioral annotation, focusing on critical issues such as bias.

Item Type:	Article
Article Type:	Article
Uncontrolled Keywords:	Annotation; Behavior; Foundation models; LLMs; Machine learning; Robotics; Social
Subjects:	Q Science > QA Mathematics > QA76 Computer software
Divisions:	Schools and Research Institutes > School of Business, Computing and Social Sciences
Research Priority Areas:	Applied Business & Technology
Depositing User:	Susan Turner
Date Deposited:	21 Mar 2024 12:33
Last Modified:	10 Oct 2025 18:00
URI:	https://eprints.glos.ac.uk/id/eprint/13773

University Staff: Request a correction | Repository Editors: Update this record

Altmetric

CORE (COnnecting REpositories)

University Of Gloucestershire

Find Us On Social Media:

Other University Web Sites

Staffnet (Staff Only)

University of Gloucestershire, The Park, Cheltenham, Gloucestershire, GL50 2RH. Telephone +44 (0)844 8010001.

© UoG 2008-24
Knowledge Base
Accessibility
Privacy and Cookies
Disclaimer
Comments concerning this page to Webmaster