Aspose.Slides Presentation Text Extractor for .NET

Aspose.Slides Presentation Text Extractor for .NET allows developers to extract text from Microsoft PowerPoint presentations programmatically, including text from slides, masters, layouts, notes, and comments.

It supports both arranged text extraction, preserving visual reading order, and unarranged extraction for raw content processing.

Installation and Setup

  1. Install the NuGet package Aspose.Slides.NET.
  2. Apply licensing at startup to remove evaluation limitations: Licensing .
  3. Check supported platforms: .NET System Requirements .

Supported Platforms

  • Windows, Linux, macOS
  • .NET 6+, .NET Framework 4.0+, Mono
  • x86, x64, ARM64 architectures

Extract Raw Text from Presentation

The Presentation Text Extractor for .NET plugin extracts text from the input Microsoft PowerPoint presentation using the specified mode.

// Extract the raw text from the PPTX presentation (Unarranged mode)
var presentationText = PresentationFactory.Instance.GetPresentationText("pres.pptx", TextExtractionArrangingMode.Unarranged);

foreach (var slideText in presentationText.SlidesText)
{
    Console.WriteLine(slideText.Text);
    Console.WriteLine(slideText.MasterText);
    Console.WriteLine(slideText.LayoutText);
    Console.WriteLine(slideText.NotesText);
    Console.WriteLine(slideText.CommentsText);
}
// Extract text with position in the same order as on the presentation slide (Arranged mode)
var presentationText = PresentationFactory.Instance.GetPresentationText("pres.ppt", TextExtractionArrangingMode.Arranged);

foreach (var slideText in presentationText.SlidesText)
    Console.WriteLine(slideText.Text);

Features

  • Extract text from:
    • Slide bodies
    • Master slides
    • Layout slides
    • Notes pages
    • Comments
  • Option to preserve text order
  • Automation-friendly for content processing
  • No dependency on Microsoft Office

Common Use Cases

  • Search indexing of presentation content
  • Document analysis and compliance scanning
  • Content extraction for AI and language processing
  • Notes and comments mining for reporting systems

FAQ

Does it extract hidden text?
Hidden content can be included based on object properties.

Are formatting details extracted?
This API extracts text only, not styling.

Does extraction support encrypted files?
Yes — if a valid password is supplied.