CmsExtractorRtf (OpenCms Core API, version 7.5.3)

Overview

Package

Class

Deprecated

PREV CLASS NEXT CLASS

FRAMES NO FRAMES

SUMMARY: NESTED | FIELD | CONSTR | METHOD

DETAIL: FIELD | CONSTR | METHOD

org.opencms.search.extractors
Class CmsExtractorRtf

java.lang.Object
  org.opencms.search.extractors.A_CmsTextExtractor
      org.opencms.search.extractors.CmsExtractorRtf

All Implemented Interfaces:: I_CmsTextExtractor

public final class CmsExtractorRtf
extends A_CmsTextExtractor
extends A_CmsTextExtractor

Extracts the text from a RTF document.

Since:: 6.0.0
Version:: $Revision: 1.14 $
Author:: Alexander Kandzior

Field Summary

Fields inherited from class org.opencms.search.extractors.A_CmsTextExtractor
`m_inputBuffer`

Method Summary
`I_CmsExtractionResult`	`extractText(byte[] content, java.lang.String encoding)` Extracts the text and meta information from the given binary document, using the specified content encoding.
`static I_CmsTextExtractor`	`getExtractor()` Returns an instance of this text extractor.

Methods inherited from class org.opencms.search.extractors.A_CmsTextExtractor
`combineContentItem, extractText, extractText, extractText, getStreamCopy, removeControlChars`

Methods inherited from class java.lang.Object
`clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait`

Method Detail

getExtractor

public static I_CmsTextExtractor getExtractor()

Returns an instance of this text extractor.

Returns:: an instance of this text extractor

extractText

public I_CmsExtractionResult extractText(byte[] content,
                                         java.lang.String encoding)
                                  throws java.lang.Exception

Description copied from interface: I_CmsTextExtractor

Extracts the text and meta information from the given binary document, using the specified content encoding.

The encoding is a hint for the text extractor, if the value given is null then the text extractor should try to figure out the encoding itself.

Specified by:: extractText in interface I_CmsTextExtractor
Overrides:: extractText in class A_CmsTextExtractor

Parameters:: content - the binary content of the document to extract the text from; encoding - the encoding to use
Returns:: the extracted text
Throws:: java.lang.Exception - if the text extration fails
See Also:: I_CmsTextExtractor.extractText(byte[], java.lang.String)