diff --git a/src/calibre/manual/conversion.rst b/src/calibre/manual/conversion.rst index e7c09a57a5..2bc5687262 100644 --- a/src/calibre/manual/conversion.rst +++ b/src/calibre/manual/conversion.rst @@ -331,6 +331,22 @@ remove all non-breaking-space entities. Some documents use a convention of defining text indents using non-breaking space entities. When this option is enabled |app| will attempt to detect this sort of formatting and convert them to a 3% text indent using css. +.. search-replace: + +Search & Replace +--------------------- + +These options are useful primarily for conversion of PDF documents. Often, the conversion leaves +behind page headers and footers in the text. These options use regular expressions to try and detect +the headers and footers and remove them. Remember that they operate on the intermediate XHTML produced +by the conversion pipeline. There is also a wizard to help you customize the regular expressions for +your document. These options can also be used for generic search and replace of any content by additionally +specifying a replacement expression. + +The search works by using a python regular expression. All matched text is simply removed from +the document or replaced using the replacement pattern. You can learn more about regular expressions and +their syntax at http://docs.python.org/library/re.html. + .. _structure-detection: Structure Detection @@ -374,21 +390,6 @@ which means that |app| will insert page breaks before every `