Multimodal Artificial Intelligence and Large Language Models: A Comprehensive Guide from Theory to Practice

Hardback Published on: 29/09/2026
Price: £140
UK delivery included
Not available
This product is currently unavailable
Make and edit your lists in your account
wordery
has a fantastic rating on
Not available
This product is currently unavailable
wordery
has a fantastic rating on

Synopsis

The book provides a comprehensive technical analysis of multimodal artificial intelligence systems and implementation frameworks. It offers thorough coverage of cross-modal processing methods for use, including speech recognition and automatic image captioning. It presents a detailed discussion of architecture for integrating text, image, audio, and video modalities, cross-modal processing pipelines, and data fusion techniques. Showcases real-time synchronization mechanisms across different modalities and scalable design patterns for multimodal systems. Discusses multimodal emotion recognition using deep Learning techniques, focusing on recent advancements, challenges, and ethical considerations. Investigates deployment optimization strategies to address issues with latency, resource usage, and scalability of multimodal systems. Focuses on techniques for performance optimization, memory management, and distributed processing for multimodal workloads using frameworks like PyTorch and TensorFlow. The text is primarily written for senior undergraduates, graduate students, and academic researchers in electrical engineering, electronics and communications engineering, computer science and engineering, and information technology.

Publisher information

  • Publisher: CRC Press
  • ISBN: 9781041152132
  • Number of pages: 376
  • Languages: English