LogoLogo
AboutBlogLaunch app ↗
v0.23.x
v0.23.x
  • Get Started
  • Overview
  • Getting Access to Distributional
  • Install the Python SDK
  • Quickstart
  • Learning about Distributional
    • Distributional Concepts
    • Why We Test Data Distributions
    • The Flow of Data
  • Using Distributional
    • Projects
    • Runs
      • Reporting Runs
      • Setting a Baseline Run
    • Metrics
    • Tests
      • Creating Tests
        • Using Filters in Tests
        • Available Statistics and Assertions
      • Running Tests
      • Reviewing Tests
        • What Is a Similarity Index?
    • Notifications
    • Access Controls
      • Organization and Namespaces
      • Users and Permissions
      • Tokens
  • Platform
    • Sandbox
    • Self-hosted
      • Architecture
      • Deployment
        • Helm Chart
        • Terraform Module
      • Networking
      • OIDC Authentication
      • Data Security
  • Reference
    • Query Language
      • Functions
    • Python SDK
      • dbnl
      • dbnl.util
      • dbnl.experimental
      • Classes
      • Eval Module
        • Quick Start
        • dbnl.eval
        • dbnl.eval.metrics
        • Application Metric Sets
        • How-To / FAQ
        • LLM-as-judge and Embedding Metrics
        • RAG / Question Answer Example
      • Classes
  • CLI
  • Versions
    • Release Notes
Powered by GitBook

© 2025 Distributional, Inc. All Rights Reserved.

On this page
  • Applications
  • Prerequisites
  • Organization

Was this helpful?

Export as PDF
  1. Tutorials

LLM Text Summarization

In this advanced tutorial, we demonstrate how to use dbnl to automatically evaluate the consistency of summarization output on a fixed set of documents.

Was this helpful?

The data files required for this tutorial are available in the following files.

Applications

While summarization is the focus of this tutorial, the same principles can be applied to any task involving text generation. The goal is to evaluate the consistency of the generated text with the input text. Other tasks involving text generation are entity recognition, question answering, and machine translation.

Prerequisites

This tutorial assumes that you have already the following tutorials: Quickstart and ideally Trading Strategy.

Organization

This tutorial requires a good deal of preparation, so it has been divided into the following four sections:

  • Defining the text summarization problem of interest, including the data source and the metrics,

  • Creating a constrained optimization problem to govern the development of a text summarization app in dbnl,

  • Managing the integration testing process for consistent testing after such an app has been created,

  • Practical considerations which would arise when actually building an LLM summarization tool.

9MB
summarization-2025-02-05-18-01-59.zip
archive
Summarization Tutorial files
8MB
prompt-engineering-2024-11-18-18-01-59.zip
archive
Prompt Engineering files