Applications and future directions in multimodal large language model: opportunities and challenges

Title: Applications and future directions in multimodal large language model: opportunities and challenges
Creator: Chandra, J.; Malviya, Meenakshi; Sabu, Samson; Rajendran, Rajesh Kanna; Joseph, Alwin
Description: Multimodal large language models (MLLMs) are an application of artificial intelligence that is rapidly growing by integrating numerous use cases. MLLMs have the capability to process data from several sources, including structured and unstructured. It enables large language models (LLMs) to give insights to the user by analyzing data from various formats. The traditional way of analyzing the data was done with a single data format. While using MLLMs, multiple data modalities are handled to manage complicated multimodal tasks, like generating content, multimodal perception, and augmenting human-computer interaction. This chapter discusses in detail the insights from data from multiple modalities and domains of the use cases of MLLMs. We also discuss the advantages of MLLMs and explain the transformative benefits from unimodal systems to multimodal systems in different sectors. We also focus on the ethical usage of MLLMs by addressing the challenges related to privacy, operational limits, bias, computational difficulties, and data scarcity. The scarce assessment metrics and trials in accomplishing robust explainability are also discussed. To train these MLLMs, acquiring and training the data utilizes more computation and power consumption, along with addressing the data security and privacy concerns. The chapter also discusses the sensible usage of AI through different problems and practices. Detailed analysis and strategies for addressing global challenges and promoting novelty in model development, outlining how these MLLMs shape the upcoming technological innovation focusing on ethical application of technology with an advantage on society. 2026 Elsevier Inc. All rights reserved.
Source: Challenges and Applications of Generative Large Language Models;pp.219-241
Date: 01-01-2026
Publisher: Elsevier
Subject: automation; customer care; healthcare assistant; information management; machine learning; Multimodal large language models
Coverage: Chandra J., Department of Computer Science, Christ University, Karnataka, Bengaluru, India; Malviya M., Department of Computer Science, Christ University, Karnataka, Bengaluru, India; Sabu S., Department of Computer Science, Christ University, Karnataka, Bengaluru, India; Rajendran R.K., Department of Computer Science, Christ University, Karnataka, Bengaluru, India; Joseph A., Department of Computer Science, Christ University, Karnataka, Bengaluru, India
Rights: Restricted Access; Hardcopy may be available in the library
Relation: ISBN: 978-044333592-1; 978-044333593-8;
Format: online
Language: English
Type: Book chapter
Identifier: https://doi.org/10.1016/B978-0-443-33592-1.00006-1

https://www.scopus.com/pages/publications/105032983757?origin=resultslist

Collection

Citation

Chandra, J.; Malviya, Meenakshi; Sabu, Samson; Rajendran, Rajesh Kanna; Joseph, Alwin, “Applications and future directions in multimodal large language model: opportunities and challenges,” CHRIST (Deemed To Be University) Institutional Repository, accessed June 18, 2026, https://archives.christuniversity.in/items/show/24232.

Applications and future directions in multimodal large language model: opportunities and challenges

Collection

Citation

Output Formats