What you’ll learn
By the end of the course, students will be able to build intelligent applications that can see, interpret, and reason over images and documents using different multimodal models and agent-based tools.
Microsoft at Lumify Work
As part of Lumify Group, Lumify Work has skilled more people in Microsoft technologies than any other organisation in Australia and New Zealand. We have a campus in the Philippines, too. We offer the broadest range of instructor-led training courses, from end user to architect level. We are proud to be the winner of the Microsoft MCT Superstars Award for FY24, which formally recognises us as having the highest quality Microsoft Certified Trainers in ANZ.
Who is the course for?
This course is designed for developers, AI engineers, and technical professionals who want to build applications that work with images and documents using multimodal, agent-driven approaches. It’s best suited for learners with basic programming experience and a general understanding of cloud or AI concepts.
Course subjects
During the course, you will be guided through key areas such as:
Understanding multimodal AI models that process images, documents, and text together.
Extracting structured information from visual and document inputs using AI-driven techniques.
Applying agent-based tools to orchestrate workflows that involve visual and language model reasoning.
Designing intelligent applications that interpret, analyse, and make decisions using visual data.
Implementing practical patterns for grounding model responses in images and documents.
Prerequisites
To be successful in this course, learners should have the following:
Basic programming experience to comfortably follow examples and implement simple multimodal or agent‑based workflows.
A general understanding of cloud or AI concepts, such as how models are used in applications or how cloud services support AI workloads.
An interest in working with images and documents as inputs for intelligent applications.
A willingness to explore multimodal AI approaches, including how visual and language models can be combined for reasoning and analysis.
FREE E-BOOK: The New Era of Cloud Computing
We've created this e-book to assist you on your cloud journey, from defining the optimal cloud infrastructure and choosing a cloud platform, to security in the cloud and the core challenges in moving to the cloud.
Terms & Conditions
The supply of this course by Lumify Work is governed by the booking terms and conditions. Please read the terms and conditions carefully before enrolling in this course, as enrolment in the course is conditional on acceptance of these terms and conditions.