Skip to Main Content
UNLV Logo

Use of Generative AI in Research: Perplexity and Burstiness

This guide will help you to navigate some of the tools and information needed to consider using generative artificial intelligence (AI) in your research

Overview

While generative AI offers promising capabilities for researchers across disciplines, understanding its nuances like perplexity and burstiness is crucial. These concepts act as tools to evaluate the AI's outputs, ensuring they are insightful, relevant, and unbiased. By being aware of these metrics, scholars can better navigate and leverage the vast potentials of generative AI in their research endeavors.

In addition, understanding these concepts could help scholars identify whether a text is AI-generated or not. While neither perplexity nor burstiness is a foolproof method to identify AI-generated content on its own, they provide valuable tools for discerning readers.

Observing for unexpected combinations of information or repetitive emphasis can offer hints toward the origin of the text. In an era of sophisticated AI, critical reading combined with an awareness of these concepts becomes more important than ever. 

Perplexity

What is it? Perplexity is a measure used to evaluate how well a probability distribution predicts a sample. In the context of generative AI, it quantifies how "surprised" the model is by a given input, based on the data it has been trained on. A lower perplexity indicates that the model is less surprised and thus better at predicting the input. 

How does it relate to AI-generated content? If an AI language model produces a piece of text that seems improbable or unexpected based on its training, the perplexity would be high. For instance, a coherent and grammatically correct text would typically have lower perplexity than a jumbled, nonsensical one.

Why is it important? Imagine you're reading a book and trying to guess the next word in a sentence. If the language and context are familiar, you can often make accurate predictions. Similarly, a language model trained on vast amounts of data uses perplexity to assess how accurately it can predict or understand the next word or piece of data

For researchers, understanding perplexity helps in:

  • Evaluating the quality of AI-generated outputs.
  • Comparing the performance of different models.
  • Assessing how well the model understands a given dataset or subject matter.
  • Deciphering AI-generated content by identifying contextual oddities

Considerations for Researchers:

  • Training Data: If a model is trained on specific genres or disciplines, it might show low perplexity for similar content but high perplexity for unfamiliar subjects.
  • Overfitting: A model with too low perplexity might be overfitted to its training data, meaning it might not generalize well to new, unseen data.

Perplexity Illustrative Examples

High Perplexity (Unexpected and Hard to Predict)

  • Prompt: "The capital of France is..."
  • Output: "a rare fruit named blue apple."
  • This output is unexpected. A model trained on general knowledge would expect the answer to be "Paris". The "blueapple" answer would greatly surprise (or have a high perplexity for) a well-trained model.

Low Perplexity (Expected and Easy to Predict)

  • Prompt: "The capital of France is..."
  • Output: "Paris."
  • This is the expected answer, so a well-trained model would predict this with low perplexity.

Identifying AI-generated Content:

  • Perplexity Clues: Generative AI models aim for coherency and alignment with the patterns they've learned from their training data. When confronted with an output that seems unexpected or nonsensical based on typical human language patterns, it might be indicative of AI generation.

Example of High Perplexity (Possible AI Error)

  • Text: "The Eiffel Tower, known for its role in the American Revolution, stands tall in Berlin."
  • The above sentence mashes together factually incorrect and geographically disparate elements. An AI, mistakenly joining different data points, might generate such an output, leading to high perplexity for any reader familiar with world history and geography.

Example of Low Perplexity (AI Imitating Human-Like Output)

  • Text: "The Eiffel Tower, an iconic symbol of Paris, is one of the most recognized structures in the world."
  • This text aligns well with what we expect based on common knowledge. While it shows an AI model is accurately replicating human-like knowledge, it doesn't necessarily indicate the content isn't AI-generated.

Burstiness

What is it? Burstiness refers to the tendency of certain events or terms to appear in clusters rather than uniformly or randomly distributed. In the context of AI-generated content, it can manifest as repetitive or clustered outputs when you might expect more diverse responses.

Why is it important? Understanding burstiness is essential because it provides insight into:

  • Quality of AI-generated content: Repetitive or too similar content might indicate the model's limited understanding or inherent biases.
  • Data patterns: Recognizing burstiness helps in identifying the patterns or biases present in the training data.

For researchers, grasping the concept of burstiness can aid in:

Detecting anomalies or repetitive patterns in AI-generated outputs.

  • Understanding potential biases in the training data.
  • Ensuring the diversity and quality of results for research applications.
  • Deciphering AI-generated content by identifying:
    • Repetition & Overemphasis: If a text frequently repeats themes, words, or ideas in a way that seems unnatural or overly emphasized, it might be a sign of AI generation.
    • Clustered Information: AI-generated content can sometimes present clustered information, meaning it might delve too deeply into a specific topic or idea, neglecting a more balanced approach.
    • Consistent Themes in Varied Inputs: If, upon giving varied prompts or inputs, the AI tends to veer towards certain themes or terms consistently, it can be an indication of its bursty nature and a hint that the content is AI-generated.

Considerations for Researchers:

  • Training Data: A model trained on data with inherent burstiness or repetition is more likely to produce clustered outputs.
  • Adjusting Parameters: Sometimes, adjusting model parameters or input prompts can help in mitigating burstiness in outputs.
  • Interdisciplinary Awareness: Understanding the norms of term usage in various disciplines can help in distinguishing genuine burstiness from discipline-specific patterns.

Burstiness Illustrative Examples

High Burstiness (Repetitive and Clustered)

  • Prompt: "Describe the ecosystem of a rainforest."
  • Output: "Rainforests are dense, moist, and full of life. Rainforests have dense vegetation. Dense trees are a common sight in rainforests. The density of the rainforests provides shelter to many animals. Because of this density, rainforests are unique."
  • Here, the output excessively focuses on the "density" aspect of rainforests, repetitively using variants of the term "dense", showcasing high burstiness.

Low Burstiness (Diverse and Spread Out)

  • Prompt: "Describe the ecosystem of a rainforest."
  • Output: "Rainforests are vibrant ecosystems teeming with diverse life. From the tall canopy trees to the undergrowth, they are home to countless species of plants, animals, and insects. The humid climate, regular rainfall, and rich soil foster rapid plant growth. This dense vegetation, in turn, provides shelter and sustenance for a plethora of animal species."
  • This output offers a varied and comprehensive view of rainforests without unnecessary repetition or clustering around a single theme.

Identifying AI-generated content:

  • Burstiness Clues:
    • Repetitiveness and overemphasis on certain words or themes can be a sign of AI generation. This might occur due to biases in the training data or the AI's propensity to get "stuck" on certain topics.

Example of High Burstiness (Possible AI Overemphasis)

  • Text: "Cats are popular pets. Cats are often kept in homes. Many people love cats because cats are affectionate. Cats, with their playful nature, make homes lively. It's no wonder cats are loved."
  • The repeated emphasis on "cats" in various contexts within a short span hints at AI generation. A human writer would likely introduce more diversity in phrasing and content.

Example of Low Burstiness (AI Imitating Diverse Human-Like Output)

  • Text: "Cats are popular pets known for their playful nature. They are often loved for being affectionate, and their distinct personalities make each one unique."
  • This content flows more naturally, offering diverse insights about cats without overly repetitive phrasing. However, smooth and diverse content doesn't mean it's not AI-generated; it just means the AI is doing a good job imitating human-like writing.
© University of Nevada Las Vegas