Question 1

Your company plans to implement a proof of concept PoC agent that uses Azure OpenAI. The solution must start small and provide flexibility to scale usage as demand grows. Which pricing model should you use?

AMicrosoft 365 Copilot

BBatch API

CProvisioned PTUs

DStandard On-Demand

Answer : D

For a proof of concept, the key requirements are low commitment, quick start, and the ability to scale up or down as you learn what real usage looks like. Azure OpenAI Standard On-Demand pricing is designed for exactly that: you pay per token consumed (input and output) on a pay-as-you-go basis, which makes it ideal when demand is uncertain or variable---typical in early pilots and PoCs.

By contrast, Provisioned (PTUs) is best when you have well-defined, predictable throughput and latency requirements---usually a more mature, production workload. PTUs involve reserving model processing capacity to achieve consistent performance and more predictable costs, which is usually premature for a PoC where actual traffic patterns are not yet known.

Batch API is optimized for asynchronous high-volume jobs with a target turnaround (for example, up to 24 hours) and discounted pricing. That's great for offline processing, but it does not match an interactive ''agent'' PoC that typically needs near-real-time responses and iterative testing.

Microsoft 365 Copilot is a separate SaaS licensing model and is not the Azure OpenAI pricing model for building your own agent solution.

Question 2

You need to recommend a service that supports indexing information and knowledge mining by extracting insights from documents. What should you recommend?

AMicrosoft Foundry

BAzure Vision in Foundry Tools

CAzure Document Intelligence in Foundry Tools

DAzure AI Search

Answer : D

The requirement has two key phrases: indexing information and knowledge mining by extracting insights from documents. The Microsoft service purpose-built for this is Azure AI Search (formerly Azure Cognitive Search), which provides a search index over your content and supports ''AI enrichment'' workflows to extract and structure insights from documents during indexing.

Azure AI Search can ingest content from common enterprise sources (files, blobs, databases), build searchable indexes, and enrich the indexed content using built-in skills or integrated AI capabilities---such as entity recognition, key phrase extraction, language detection, and OCR (depending on the pipeline). This is exactly what ''knowledge mining'' refers to: turning large volumes of unstructured documents into structured, searchable knowledge that applications and users can query.

The other choices are partial fits: Azure Vision focuses on image/video analysis, not general document indexing. Azure Document Intelligence is excellent for extracting fields/tables from forms and documents, but on its own it does not provide the full indexing/search and knowledge mining layer across a corpus. Microsoft Foundry is an overarching platform for building AI apps/agents; it can incorporate search, but the specific service that directly delivers indexing + knowledge mining is Azure AI Search.

Question 3

Which statement accurately describes the difference between a pretrained generative AI model and a fine-tuned generative AI model?

AA pretrained model requires labeled data, while a fine-tuned model does not.

BA pretrained model is faster to train than a fine-tuned model because the pretrained model uses fewer parameters.

CA pretrained model is trained on broad datasets, while a fine-tuned model is adapted to perform well on a narrower, domain-specific dataset.

DA pretrained model is optimized for a specific task, while a fine-tuned model is designed for general-purpose use.

Answer : C

A pretrained generative AI model is trained initially on a large, broad, and diverse dataset so it learns general language (or multimodal) patterns and capabilities. Fine-tuning then takes that pretrained base and performs additional training on a smaller, task- or domain-specific dataset to specialize behavior---improving performance for a particular use case, tone, style, or domain knowledge representation. That is exactly what option C states, making it the correct answer.

Option A is incorrect because both pretraining and fine-tuning may use labeled or unlabeled data depending on the technique; the distinction is not ''labeled vs. unlabeled.'' Option B is incorrect because a pretrained model is not ''faster to train'' due to fewer parameters; pretraining is typically the most compute-intensive phase precisely because it's done at large scale, while fine-tuning is smaller but still trains the same model architecture. Option D is reversed: the pretrained model is the general-purpose foundation, while the fine-tuned model is the specialized variant for a specific task or dataset.

Question 4

Your company manages an online catalog of office supplies. You plan to use a generative AI solution to create product descriptions for your company's website. The solution must ensure descriptions can be posted immediately after creation, enable selection/inclusion of product details, and be fast and simple for non-technical staff. What is the best type of solution to use? Select the BEST answer.

Acustom Azure Machine Learning model

Bthe Researcher agent in Microsoft 365 Copilot

Can interactive AI agent

Da fine-tuned large language model LLM

Answer : D

The task is high-volume content generation with consistent structure and immediate publishing: product descriptions that reliably include chosen product attributes (brand, specs, materials, dimensions, use cases) and can be produced quickly by non-technical staff. The best fit is a fine-tuned LLM (D) because fine-tuning can standardize tone, format, and completeness against your catalog schema, reducing variability and minimizing manual editing before posting. With a fine-tuned model, you can strongly enforce style guidelines (length, voice, prohibited claims), and you can template prompts so staff only supply product fields and get publish-ready copy.

Option A is not best: Azure Machine Learning is excellent for predictive models but is unnecessary for straightforward text generation. B (Researcher) is optimized for multistep research across work data + web, not deterministic product copy generation. C (interactive agent) can help collect requirements, but it's more complexity than needed; the core need is consistent text generation from structured product data, which fine-tuning addresses directly while keeping user interaction simple (fill fields generate description).

Question 5

Which business requirement most closely relates to grounding a generative AI model?

Asupporting multiple languages

Bmeasuring the number of user interactions per day

Cenabling users to interact by using natural language queries

Densuring that verified company data sources are used for response generation

Answer : D

Grounding in generative AI means ensuring model outputs are based on trusted, relevant information sources rather than only on the model's general training data. In a business context, grounding is about aligning responses with verified enterprise knowledge (policies, product documentation, internal procedures, approved FAQs, etc.) so the system is more accurate, consistent, and defensible. That is exactly what option D describes: ''ensuring that verified company data sources are used for response generation.''

In Microsoft AI solution patterns, grounding is commonly achieved using retrieval-augmented generation (RAG). With RAG, the system retrieves relevant passages from approved company repositories (for example, indexed documents or knowledge bases) and supplies them as context to the model during response generation. This reduces hallucinations, improves factual correctness, and makes answers more relevant to the organization's reality---critical when AI is used for customer support, employee helpdesks, compliance guidance, or executive reporting.

The other options do not directly address grounding. A relates to localization/multilingual capability, B is a usage/telemetry metric, and C is an interaction method (natural language interface). They can all be important requirements, but none of them ensure outputs are anchored to verified company data---the core purpose of grounding.

Free Practice Questions for Microsoft AB-731 Exam

Question 1

Question 2

Question 3

Question 4

Question 5