الاستماع والقراءة

خطوة إلى عالم لا حدود له من القصص

  • اقرأ واستمع إلى ما تريده
  • أكثر من مليون عنوان
  • العناوين الحصرية + أصول القصة
  • 7 يوم تجربة مجانية، ثم 9.99$ يورو في الشهر
  • من السهل الإلغاء في أي وقت
جرب مجانا
Details page - Device banner - 894x1036
Cover for Ultimate Multimodal Transformer Models

Ultimate Multimodal Transformer Models

اللغة
الإنجليزية
الصيغة
تصنيف

كتب واقعية

One Architecture. Infinite Intelligence.

Book Description

Transformer architectures have become the unified foundation of modern AI — powering language models, computer vision systems, and multimodal applications that process text, images, and speech together. Ultimate Multimodal Transformer Models provides a comprehensive, hands-on guide to mastering every major Transformer variant, from foundational encoder-decoder architectures to cutting-edge vision-language models and production GenAI systems.

You begin with the core building blocks of Transformer architecture and text data preparation, then progressively advance through encoder-only models, generative LLMs, RAG, Agentic workflows, and efficient fine-tuning using PEFT, LoRA, and QLoRA. The book then transitions into Vision Transformers, covering ViT, DETR, SAM, CLIP, and Flamingo, before bringing everything together in real-world multimodal applications combining text, vision, and speech using PyTorch and Hugging Face throughout.

By the end of the book, you will be proficient to build, fine-tune, and deploy Transformer-based AI systems across text, vision, and multimodal domains with confidence, applying the right architecture and strategy for every real-world use case!

What you will learn

? Build and deploy Transformer models for text, vision, and multimodal AI tasks.

? Fine-tune large language models efficiently using PEFT, LoRA, and QLoRA techniques.

? Develop production-ready GenAI applications using RAG pipelines and Agentic AI workflows.

? Apply LLMs to real-world NLP tasks including summarization, question answering, and classification.

? Implement Vision Transformers, DETR, and SAM for object detection and image segmentation tasks.

? Integrate multimodal AI systems combining text, vision, and speech using CLIP and Flamingo architectures.

Table of Contents

1. The Rise of Transformer Models in Sequence Learning

2. Text Data Preparation for Transformer Models

3. Building Blocks of Transformer Architecture

4. Encoder-only Transformer Configurations

5. Generative Transformers and LLM Architectures

6. Customizing LLMs Using Retrieval-Augmented Generation (RAG)

7. Efficient Fine-Tuning Techniques with PEFT and LoRA

8. Orchestrating LLMs with Tools and Memory

9. Introduction to Vision Transformer Models

10. Vision Transformers for Image Classification

11. Object Detection and Segmentation with Transformer Architectures

12. Vision-Language Models and Multimodal LLMs

13. Real-World Multimodal GenAI Applications

14. Image Generation with Vision Transformers

15. The Future of GenAI with Transformers

Index

© 2026 Orange Education Pvt Ltd (كتاب إلكتروني): 9788169646833

تاريخ النشر

كتاب إلكتروني: 2 يونيو 2026

  1. A Country Doctor
    A Country Doctor Sarah Orne Jewett
    2.8
  2. PMP Pro: Transform Your Exam Success with Game-Changing Secrets: "Elevate your PMP exam results! Dive into transformative audio lessons for peak performance on test day."
    PMP Pro: Transform Your Exam Success with Game-Changing Secrets: "Elevate your PMP exam results! Dive into transformative audio lessons for peak performance on test day." Arden Blakewood
    0
  3. Desconexión Digital: Meditaciones Guiadas para Calma y Claridad
    Desconexión Digital: Meditaciones Guiadas para Calma y Claridad Refeser
    5
  4. Nature’s Symphony of Serene Forest Cricket Sounds Mixed With Piano Rhythms For Deep Calm & Relaxation: Experience Soothing Nights for Restful Sleep & Mindfulness Using Enhanced BGM 8D Audio
    Nature’s Symphony of Serene Forest Cricket Sounds Mixed With Piano Rhythms For Deep Calm & Relaxation: Experience Soothing Nights for Restful Sleep & Mindfulness Using Enhanced BGM 8D Audio Cedar Skye
    5
  5. GED Secrets: Elevate Your Success and Conquer the Exam Today: "Boost your GED prep! Unlock engaging audio lessons for ultimate exam success today!"
    GED Secrets: Elevate Your Success and Conquer the Exam Today: "Boost your GED prep! Unlock engaging audio lessons for ultimate exam success today!" Ronan Cade
    1
  6. Nature’s Symphony Of Soothing Lake Soundscapes For Meditation, Deep Relaxation & Stress Relief: Embrace The Harmony & Feel The Water Waves With Blissful 8d Audio For Inner Peace & Serenity
    Nature’s Symphony Of Soothing Lake Soundscapes For Meditation, Deep Relaxation & Stress Relief: Embrace The Harmony & Feel The Water Waves With Blissful 8d Audio For Inner Peace & Serenity Cedar Skye
    0
  7. The Complete Falconer Files Brief Cases Books 1 - 8
    The Complete Falconer Files Brief Cases Books 1 - 8 Andrea Frazer
    4
  8. For All Time: First in the Liza Marchant Series
    For All Time: First in the Liza Marchant Series Marian L Jasper
    0
  9. Data-Driven Decisions: Mastering Business Data Science
    Data-Driven Decisions: Mastering Business Data Science Chuck Sherman
    4
  10. Nature’s Symphony of Tranquil Forest Soundscapes Using Enhanced 8D Audio For A More Natural Relaxation: Meditation Aid For Unmatched Calm, Emotional Healing, Mental Clarity & Stress Relief
    Nature’s Symphony of Tranquil Forest Soundscapes Using Enhanced 8D Audio For A More Natural Relaxation: Meditation Aid For Unmatched Calm, Emotional Healing, Mental Clarity & Stress Relief Cedar Skye
    0
  11. Notas sobre Enfermería: (Español latino)
    Notas sobre Enfermería: (Español latino) Florence Nightingale
    0
  12. SHRM: Your HR Exam Success with Insider Secrets: "Boost your HR exam readiness! Harness powerful audio lessons packed with insider tips for ultimate success."
    SHRM: Your HR Exam Success with Insider Secrets: "Boost your HR exam readiness! Harness powerful audio lessons packed with insider tips for ultimate success." Ronan Ashwood
    0
  13. NONPARTICIPANT: A New Dawn
    NONPARTICIPANT: A New Dawn Rob Johnson
    1
  14. Intermediate Japanese: Broaden Your Japanese Vocabulary and Cultural Understanding
    Intermediate Japanese: Broaden Your Japanese Vocabulary and Cultural Understanding Yuki Abe
    0
  15. Ne, ne, ne, was hab ich bloß?: Humorvolle Geschichten über den menschlichen Körper
    Ne, ne, ne, was hab ich bloß?: Humorvolle Geschichten über den menschlichen Körper Edla Pinnow
    5

دائمًا برفقة Storytel

  • أكثر من 200000 عنوان

  • وضع الأطفال (بيئة آمنة للأطفال)

  • تنزيل الكتب للوصول إليها دون الاتصال بالإنترنت

  • الإلغاء في أي وقت

الكتب الأكثر استماعًا

شهري

قصص لكل المناسبات.

$9.99 /شهر

  • 1 حساب

  • استماع بلا حدود

  • إلغاء في أي وقت

جرب الآن

سنويا

قصص لكل المناسبات.

$83.88 /سنة

وفر 30%
  • 1 حساب

  • استماع بلا حدود

  • إلغاء في أي وقت

جرب الآن

6 أشهر

قصص لكل المناسبات.

$53.64 /6 أشهر

وفر 11%
  • 1 حساب

  • استماع بلا حدود

  • إلغاء في أي وقت

جرب الآن