Question 17 - IBM C1000-185 Watsonx Generative AI Engineer Real Exam Questions [March 2026 Update]

Q: 17

You are tasked with generating a product description for an e-commerce platform using a generative AI model. However, you notice that the generated text tends to repeat phrases excessively, leading to verbose output. To address this, you decide to adjust the model's temperature parameter. Which of the following changes would help reduce the repetitiveness of the generated text while maintaining a balance between creativity and coherence?

Options

Correct Answer:

Explanation

The temperature parameter controls the randomness of the model's output. A higher temperature (e.g., 0.9) increases randomness, making the model more creative but also more likely to produce unfocused, verbose, and rambling text that may repeat ideas or phrases. To address excessive repetition and verbosity, decreasing the temperature is necessary. Lowering the temperature from 0.9 to 0.3 makes the model's output more deterministic and focused by increasing the probability of selecting high-likelihood tokens. This change significantly reduces verbosity and rambling repetition, leading to more coherent and concise text while still allowing for some variation, thus maintaining a balance.

Why Incorrect

A. Increasing the temperature to 1.5 would make the output even more random and likely incoherent, worsening the problem rather than solving it.

C. Setting the temperature to 0.0 makes the output completely deterministic (greedy decoding), which is highly prone to getting stuck in repetitive loops and lacks creativity.

D. While decreasing the temperature is the correct approach, this is a minor adjustment. For "excessive" repetition, a more significant change, as in option B, is required.

References

1. IBM watsonx.ai Documentation: In the official documentation for tuning foundation models

it states: "If you want output that is more predictable and focused

try a lower temperature. If you want more variety

try a higher temperature." This supports decreasing the temperature to address verbose and unfocused (repetitive) output.

Source: IBM Cloud Docs

"Tuning foundation models". Section: "Decoding parameters".

2. Stanford University Courseware (CS224N): Lecture materials on sequence generation explain that temperature is used to scale the logits before applying the softmax function. A lower temperature makes the probability distribution sharper

favoring more likely words and leading to less random

more focused text. A higher temperature flattens the distribution

increasing randomness.

Source: Stanford CS224N: NLP with Deep Learning

Lecture Notes on Sequence Models and Generation.

3. Academic Publication on Text Generation: The paper "The Curious Case of Neural Text Degeneration" discusses decoding strategies. It notes that high-likelihood

human-like text is often found in a specific range of the probability distribution

avoiding both the repetitive nature of greedy decoding (low temperature) and the incoherent nature of highly random sampling (high temperature). Lowering the temperature from a high value brings the output closer to this desired range.

Source: Holtzman

et al. (2019). "The Curious Case of Neural Text Degeneration". arXiv preprint arXiv:1904.09751. (Section 3 discusses sampling methods and the effect of temperature).

Premium Access Includes

FLASH OFFER

avail 10% DISCOUNT on YOUR PURCHASE