mirror of
https://github.com/kovidgoyal/calibre.git
synced 2025-07-09 03:04:10 -04:00
String changes
This commit is contained in:
parent
668e208611
commit
0830a0d9a0
@ -28,7 +28,7 @@ it, a tooltip will appear describing the setting.
|
||||
|
||||
.. contents:: Contents
|
||||
:depth: 1
|
||||
:local:
|
||||
:local:
|
||||
|
||||
.. _conversion-introduction:
|
||||
|
||||
@ -46,9 +46,9 @@ Schematically, it looks like this:
|
||||
:align: center
|
||||
:alt: The conversion pipeline
|
||||
|
||||
The input format is first converted to XHTML by the appropriate *Input Plugin*.
|
||||
The input format is first converted to XHTML by the appropriate *Input plugin*.
|
||||
This HTML is then *transformed*. In the last step, the processed XHTML is converted
|
||||
to the specified output format by the appropriate *output plugin*. The results
|
||||
to the specified output format by the appropriate *Output plugin*. The results
|
||||
of the conversion can vary greatly, based on the input format. Some formats
|
||||
convert much better than others. A list of the best source formats for conversion
|
||||
is available :ref:`here <best-source-formats>`.
|
||||
@ -56,10 +56,10 @@ is available :ref:`here <best-source-formats>`.
|
||||
The transforms that act on the XHTML output are where all the work happens. There are various
|
||||
transforms, for example, to insert book metadata as a page at the start of the book,
|
||||
to detect chapter headings and automatically create a Table of Contents, to proportionally
|
||||
adjust font sizes, et cetera. It is important to remeber that all the transforms act on the
|
||||
XHTML output by the *Input Plugin*, not on the input file itself. So, for example, if you ask calibre
|
||||
adjust font sizes, et cetera. It is important to remember that all the transforms act on the
|
||||
XHTML output by the *Input plugin*, not on the input file itself. So, for example, if you ask calibre
|
||||
to convert an RTF file to EPUB, it will first be converted to XHTML internally,
|
||||
the various transforms will be applied to the XHTML and then the *output plugin* will
|
||||
the various transforms will be applied to the XHTML and then the *Output plugin* will
|
||||
create the EPUB file, automatically generating all metadata, Table of Contents, et cetera.
|
||||
|
||||
You can see this process in action by using the debug option |dbgi|. Just specify the path to
|
||||
@ -72,14 +72,14 @@ The four sub-directories are:
|
||||
========== =============
|
||||
Directory Description
|
||||
========== =============
|
||||
input This contains the HTML output by the Input Plugin. Use this to debug the Input Plugin.
|
||||
parsed The result of pre-processing and converting to XHTML the output from the Input Plugin. Use to debug structure detection.
|
||||
input This contains the HTML output by the Input plugin. Use this to debug the Input plugin.
|
||||
parsed The result of pre-processing and converting to XHTML the output from the Input plugin. Use to debug structure detection.
|
||||
structure Post structure detection, but before CSS flattening and font size conversion. Use to debug font size conversion and CSS transforms.
|
||||
processed Just before the e-book is passed to the output plugin. Use to debug the output plugin.
|
||||
processed Just before the e-book is passed to the Output plugin. Use to debug the Output plugin.
|
||||
========== =============
|
||||
|
||||
If you want to edit the input document a little before having calibre convert it, the best thing to
|
||||
do is edit the files in the :file:`input` sub-directory, then zip it up, and use the ZIP file as the
|
||||
If you want to edit the input document a little before having calibre convert it, the best thing to
|
||||
do is edit the files in the :file:`input` sub-directory, then zip it up, and use the ZIP file as the
|
||||
input format for subsequent conversions. To do this use the :guilabel:`Edit meta information` dialog
|
||||
to add the ZIP file as a format for the book and then, in the top left corner of the conversion dialog,
|
||||
select ZIP as the input format.
|
||||
@ -94,7 +94,7 @@ Look & feel
|
||||
:depth: 1
|
||||
:local:
|
||||
|
||||
This group of options controls various aspects of the look and feel of the converted e-book.
|
||||
This group of options controls various aspects of the look and feel of the converted e-book.
|
||||
|
||||
.. _font-size-rescaling:
|
||||
|
||||
@ -118,19 +118,19 @@ Normally, calibre will automatically choose a base font size appropriate to the
|
||||
have chosen (see :ref:`page-setup`). However, you can override this here in case the default is
|
||||
not suitable for you.
|
||||
|
||||
The :guilabel:`Font size key` option lets you control how non-base font sizes are rescaled.
|
||||
The :guilabel:`Font size key` option lets you control how non-base font sizes are rescaled.
|
||||
The font rescaling algorithm works using a font size key, which is simply a comma-separated
|
||||
list of font sizes. The font size key tells calibre how many "steps" bigger or smaller a given font
|
||||
size should be compared to the base font size. The idea is that there should be a limited number
|
||||
of font sizes in a document. For example, one size for the body text, a couple of sizes for
|
||||
different levels of headings and a couple of sizes for super/sub scripts and footnotes. The
|
||||
font size key allows calibre to compartmentalize the font sizes in the input documents into
|
||||
separate "bins" corresponding to the different logical font sizes.
|
||||
separate "bins" corresponding to the different logical font sizes.
|
||||
|
||||
Let's illustrate with an example.
|
||||
Let's illustrate with an example.
|
||||
Suppose the source document we are converting was produced by someone with excellent
|
||||
eyesight and has a base font size of 8pt. That means the bulk of the text in the document is sized
|
||||
at 8pts, while headings are somewhat larger (say 10 and 12pt) and footnotes somewhat smaller at 6pt.
|
||||
at 8pts, while headings are somewhat larger (say 10 and 12pt) and footnotes somewhat smaller at 6pt.
|
||||
Now if we use the following settings::
|
||||
|
||||
Base font size : 12pt
|
||||
@ -148,7 +148,7 @@ font rescaling wizard, which can be accessed by clicking the little button next
|
||||
:guilabel:`Font size key` setting.
|
||||
|
||||
All the font size rescaling in the conversion can also be disabled here, if you would
|
||||
like to preserve the font sizes in the input document.
|
||||
like to preserve the font sizes in the input document.
|
||||
|
||||
A related setting is :guilabel:`Line height`. Line height controls the vertical height of
|
||||
lines. By default, (a line height of 0), no manipulation of line heights is performed. If
|
||||
@ -159,7 +159,7 @@ the :ref:`Extra CSS <extra-css>`.
|
||||
|
||||
In this section you can also tell calibre to embed any referenced fonts into
|
||||
the book. This will allow the fonts to work on reader devices even if they are
|
||||
not available on the device.
|
||||
not available on the device.
|
||||
|
||||
Text
|
||||
~~~~~~~~
|
||||
@ -183,7 +183,7 @@ document, but it does not always succeed. You can force it to assume a
|
||||
particular character encoding by using this setting. `cp1252` is a common
|
||||
encoding for documents produced using Windows software. You should also read
|
||||
:ref:`char-encoding-faq` for more on encoding issues.
|
||||
|
||||
|
||||
Layout
|
||||
~~~~~~~~~~~~~~~~~~~
|
||||
|
||||
@ -192,7 +192,7 @@ indent. calibre has a couple of options to control this. :guilabel:`Remove spaci
|
||||
forcefully ensure that all paragraphs have no inter paragraph spacing. It also sets the text
|
||||
indent to 1.5em (can be changed) to mark the start of every paragraph.
|
||||
:guilabel:`Insert blank line` does the
|
||||
opposite, guaranteeing that there is exactly one blank line between each pair of paragraphs.
|
||||
opposite, guaranteeing that there is exactly one blank line between each pair of paragraphs.
|
||||
Both these options are very comprehensive, removing spacing, or inserting it for *all* paragraphs
|
||||
(technically <p> and <div> tags). This is so that you can just set the option and be sure that
|
||||
it performs as advertised, irrespective of how messy the input file is. The one exception is
|
||||
@ -227,7 +227,7 @@ priority and so should override most CSS present in the **input document**
|
||||
itself. You can use this setting to fine tune the presentation/layout of your
|
||||
document. For example, if you want all paragraphs of class `endnote` to be
|
||||
right aligned, just add::
|
||||
|
||||
|
||||
.endnote { text-align: right }
|
||||
|
||||
or if you want to change the indentation of all paragraphs::
|
||||
@ -260,7 +260,7 @@ The :guilabel:`Page setup` options are for controlling screen layout, like margi
|
||||
options to setup page margins, which will be used by the output plugin, if the selected output format
|
||||
supports page margins. In addition, you should choose an Input profile and an output profile. Both sets
|
||||
of profiles basically deal with how to interpret measurements in the input/output documents, screen sizes
|
||||
and default font rescaling keys.
|
||||
and default font rescaling keys.
|
||||
|
||||
If you know that the file you are converting was intended to be used on a particular device/software platform,
|
||||
choose the corresponding input profile, otherwise just choose the default input profile. If you know the files
|
||||
@ -273,9 +273,9 @@ The output profile also controls the screen size. This will cause, for example,
|
||||
Heuristic processing
|
||||
---------------------
|
||||
|
||||
Heuristic processing provides a variety of functions which can be used to try and detect and correct
|
||||
common problems in poorly formatted input documents. Use these functions if your input document suffers
|
||||
from poor formatting. Because these functions rely on common patterns, be aware that in some cases an
|
||||
Heuristic processing provides a variety of functions which can be used to try and detect and correct
|
||||
common problems in poorly formatted input documents. Use these functions if your input document suffers
|
||||
from poor formatting. Because these functions rely on common patterns, be aware that in some cases an
|
||||
option may lead to worse results, so use with care. As an example, several of these options will
|
||||
remove all non-breaking-space entities, or may include false positive matches relating to the function.
|
||||
|
||||
@ -284,9 +284,9 @@ remove all non-breaking-space entities, or may include false positive matches re
|
||||
This must be enabled in order for various sub-functions to be applied
|
||||
|
||||
:guilabel:`Unwrap lines`
|
||||
Enabling this option will cause calibre to attempt to detect and correct hard line breaks that exist
|
||||
within a document using punctuation clues and line length. calibre will first attempt to detect whether
|
||||
hard line breaks exist, if they do not appear to exist calibre will not attempt to unwrap lines. The
|
||||
Enabling this option will cause calibre to attempt to detect and correct hard line breaks that exist
|
||||
within a document using punctuation clues and line length. calibre will first attempt to detect whether
|
||||
hard line breaks exist, if they do not appear to exist calibre will not attempt to unwrap lines. The
|
||||
line-unwrap factor can be reduced if you want to 'force' calibre to unwrap lines.
|
||||
|
||||
:guilabel:`Line-unwrap factor`
|
||||
@ -294,61 +294,61 @@ remove all non-breaking-space entities, or may include false positive matches re
|
||||
option is 0.4, that means calibre will remove hard line breaks from the end of lines whose lengths are less
|
||||
than the length of 40% of all lines in the document. If your document only has a few line breaks which need
|
||||
correction, then this value should be reduced to somewhere between 0.1 and 0.2.
|
||||
|
||||
|
||||
:guilabel:`Detect and markup unformatted chapter headings and sub headings`
|
||||
If your document does not have chapter headings and titles formatted differently from the rest of the text,
|
||||
calibre can use this option to attempt detection them and surround them with heading tags. <h2> tags are used
|
||||
for chapter headings; <h3> tags are used for any titles that are detected.
|
||||
|
||||
This function will not create a TOC, but in many cases it will cause calibre's default chapter detection settings
|
||||
calibre can use this option to attempt detection them and surround them with heading tags. <h2> tags are used
|
||||
for chapter headings; <h3> tags are used for any titles that are detected.
|
||||
|
||||
This function will not create a TOC, but in many cases it will cause calibre's default chapter detection settings
|
||||
to correctly detect chapters and build a TOC. Adjust the XPath under Structure detection if a TOC is not automatically
|
||||
created. If there are no other headings used in the document then setting "//h:h2" under Structure detection would
|
||||
be the easiest way to create a TOC for the document.
|
||||
|
||||
|
||||
The inserted headings are not formatted, to apply formatting use the :guilabel:`Extra CSS` option under
|
||||
the Look and Feel conversion settings. For example, to center heading tags, use the following::
|
||||
|
||||
h2, h3 { text-align: center }
|
||||
|
||||
:guilabel:`Renumber sequences of <h1> or <h2> tags`
|
||||
Some publishers format chapter headings using multiple <h1> or <h2> tags sequentially.
|
||||
calibre's default conversion settings will cause such titles to be split into two pieces. This option
|
||||
Some publishers format chapter headings using multiple <h1> or <h2> tags sequentially.
|
||||
calibre's default conversion settings will cause such titles to be split into two pieces. This option
|
||||
will re-number the heading tags to prevent splitting.
|
||||
|
||||
:guilabel:`Delete blank lines between paragraphs`
|
||||
This option will cause calibre to analyze blank lines included within the document. If every paragraph is interleaved
|
||||
with a blank line, then calibre will remove all those blank paragraphs. Sequences of multiple blank lines will be
|
||||
considered scene breaks and retained as a single paragraph. This option differs from the 'Remove Paragraph Spacing'
|
||||
considered scene breaks and retained as a single paragraph. This option differs from the 'Remove Paragraph Spacing'
|
||||
option under 'Look and Feel' in that it actually modifies the HTML content, while the other option modifies the document
|
||||
styles. This option can also remove paragraphs which were inserted using calibre's 'Insert blank line' option.
|
||||
|
||||
:guilabel:`Ensure scene breaks are consistently formatted`
|
||||
With this option calibre will attempt to detect common scene-break markers and ensure that they are center aligned.
|
||||
'Soft' scene break markers, i.e. scene breaks only defined by extra white space, are styled to ensure that they
|
||||
With this option calibre will attempt to detect common scene-break markers and ensure that they are center aligned.
|
||||
'Soft' scene break markers, i.e. scene breaks only defined by extra white space, are styled to ensure that they
|
||||
will not be displayed in conjunction with page breaks.
|
||||
|
||||
:guilabel:`Replace scene breaks`
|
||||
If this option is configured then calibre will replace scene break markers it finds with the replacement text specified by the
|
||||
user. Please note that some ornamental characters may not be supported across all reading devices.
|
||||
|
||||
|
||||
In general you should avoid using HTML tags, calibre will discard any tags and use pre-defined markup. <hr />
|
||||
tags, i.e. horizontal rules, and <img> tags are exceptions. Horizontal rules can optionally be specified with styles, if you
|
||||
choose to add your own style be sure to include the 'width' setting, otherwise the style information will be discarded. Image
|
||||
tags can used, but calibre does not provide the ability to add the image during conversion, this must be done after the fact using
|
||||
tags, i.e. horizontal rules, and <img> tags are exceptions. Horizontal rules can optionally be specified with styles, if you
|
||||
choose to add your own style be sure to include the 'width' setting, otherwise the style information will be discarded. Image
|
||||
tags can used, but calibre does not provide the ability to add the image during conversion, this must be done after the fact using
|
||||
the 'Edit book' feature.
|
||||
|
||||
|
||||
Example image tag (place the image within an 'Images' folder inside the epub after conversion):
|
||||
<img style="width:10%" src="../Images/scenebreak.png" />
|
||||
|
||||
|
||||
Example horizontal rule with styles:
|
||||
<hr style="width:20%;padding-top: 1px;border-top: 2px ridge black;border-bottom: 2px groove black;"/>
|
||||
|
||||
|
||||
:guilabel:`Remove unnecessary hyphens`
|
||||
calibre will analyze all hyphenated content in the document when this option is enabled. The document itself is used
|
||||
as a dictionary for analysis. This allows calibre to accurately remove hyphens for any words in the document in any language,
|
||||
along with made-up and obscure scientific words. The primary drawback is words appearing only a single time in the document
|
||||
will not be changed. Analysis happens in two passes, the first pass analyzes line endings. Lines are only unwrapped if the
|
||||
word exists with or without a hyphen in the document. The second pass analyzes all hyphenated words throughout the document,
|
||||
as a dictionary for analysis. This allows calibre to accurately remove hyphens for any words in the document in any language,
|
||||
along with made-up and obscure scientific words. The primary drawback is words appearing only a single time in the document
|
||||
will not be changed. Analysis happens in two passes, the first pass analyzes line endings. Lines are only unwrapped if the
|
||||
word exists with or without a hyphen in the document. The second pass analyzes all hyphenated words throughout the document,
|
||||
hyphens are removed if the word exists elsewhere in the document without a match.
|
||||
|
||||
:guilabel:`Italicize common words and patterns`
|
||||
@ -364,17 +364,17 @@ remove all non-breaking-space entities, or may include false positive matches re
|
||||
Search & replace
|
||||
---------------------
|
||||
|
||||
These options are useful primarily for conversion of PDF documents or OCR conversions, though they can
|
||||
also be used to fix many document specific problems. As an example, some conversions can leaves behind page
|
||||
headers and footers in the text. These options use regular expressions to try and detect headers, footers,
|
||||
or other arbitrary text and remove or replace them. Remember that they operate on the intermediate XHTML produced
|
||||
by the conversion pipeline. There is a wizard to help you customize the regular expressions for
|
||||
your document. Click the magic wand beside the expression box, and click the 'Test' button after composing
|
||||
These options are useful primarily for conversion of PDF documents or OCR conversions, though they can
|
||||
also be used to fix many document specific problems. As an example, some conversions can leaves behind page
|
||||
headers and footers in the text. These options use regular expressions to try and detect headers, footers,
|
||||
or other arbitrary text and remove or replace them. Remember that they operate on the intermediate XHTML produced
|
||||
by the conversion pipeline. There is a wizard to help you customize the regular expressions for
|
||||
your document. Click the magic wand beside the expression box, and click the 'Test' button after composing
|
||||
your search expression. Successful matches will be highlighted in Yellow.
|
||||
|
||||
The search works by using a Python regular expression. All matched text is simply removed from
|
||||
the document or replaced using the replacement pattern. The replacement pattern is optional, if left blank
|
||||
then text matching the search pattern will be deleted from the document. You can learn more about regular expressions
|
||||
The search works by using a Python regular expression. All matched text is simply removed from
|
||||
the document or replaced using the replacement pattern. The replacement pattern is optional, if left blank
|
||||
then text matching the search pattern will be deleted from the document. You can learn more about regular expressions
|
||||
and their syntax at :ref:`regexptutorial`.
|
||||
|
||||
.. _structure-detection:
|
||||
@ -390,7 +390,7 @@ Chapters and page breaks
|
||||
calibre has two sets of options for :guilabel:`chapter detection` and :guilabel:`inserting page breaks`. This can sometimes be slightly confusing, as by default,
|
||||
calibre will insert page breaks before detected chapters as well as the locations detected by the page breaks option.
|
||||
The reason for this is that there are often location where page breaks should be inserted that are not chapter boundaries.
|
||||
Also, detected chapters can be optionally inserted into the auto generated Table of Contents.
|
||||
Also, detected chapters can be optionally inserted into the auto generated Table of Contents.
|
||||
|
||||
calibre uses *XPath*, a powerful language to allow the user to specify chapter boundaries/page breaks. XPath can seem a little daunting
|
||||
to use at first, fortunately, there is a :ref:`XPath tutorial <xpath-tutorial>` in the User Manual. Remember that Structure detection
|
||||
@ -403,8 +403,8 @@ By default, calibre uses the following expression for chapter detection::
|
||||
//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|part\s+', 'i')) or @class = 'chapter']
|
||||
|
||||
This expression is rather complex, because it tries to handle a number of common cases simultaneously. What it means
|
||||
is that calibre will assume chapters start at either `<h1>` or `<h2>` tags that have any of the words
|
||||
`(chapter, book, section or part)` in them or that have the `class="chapter"` attribute.
|
||||
is that calibre will assume chapters start at either `<h1>` or `<h2>` tags that have any of the words
|
||||
`(chapter, book, section or part)` in them or that have the `class="chapter"` attribute.
|
||||
|
||||
A related option is :guilabel:`Chapter mark`, which allows you to control what calibre does when it detects a chapter. By default,
|
||||
it will insert a page break before the chapter. You can have it insert a ruled line instead of, or in addition to the page break.
|
||||
@ -417,7 +417,7 @@ The default setting for detecting page breaks is::
|
||||
which means that calibre will insert page breaks before every `<h1>` and `<h2>` tag by default.
|
||||
|
||||
.. note::
|
||||
|
||||
|
||||
The default expressions may change depending on the input format you are converting.
|
||||
|
||||
Miscellaneous
|
||||
@ -437,7 +437,7 @@ There are a few more options in this section.
|
||||
two covers. This option will simply remove the first image from the source document, thereby
|
||||
ensuring that the converted book has only one cover, the one specified in calibre.
|
||||
|
||||
|
||||
|
||||
Table of Contents
|
||||
------------------
|
||||
|
||||
@ -449,8 +449,8 @@ generate a Table of Contents in the converted e-book, based on the actual conten
|
||||
.. note:: Using these options can be a little challenging to get exactly right.
|
||||
If you prefer creating/editing the Table of Contents by hand, convert to
|
||||
the EPUB or AZW3 formats and select the checkbox at the bottom of the Table
|
||||
of Contents section of the conversion dialog that says
|
||||
:guilabel:`Manually fine-tune the Table of Contents after conversion`.
|
||||
of Contents section of the conversion dialog that says
|
||||
:guilabel:`Manually fine-tune the Table of Contents after conversion`.
|
||||
This will launch the ToC Editor tool after the conversion. It allows you to
|
||||
create entries in the Table of Contents by simply clicking the place in the
|
||||
book where you want the entry to point. You can also use the ToC Editor by
|
||||
@ -460,7 +460,7 @@ generate a Table of Contents in the converted e-book, based on the actual conten
|
||||
|
||||
The first option is :guilabel:`Force use of auto-generated Table of Contents`. By checking this option
|
||||
you can have calibre override any Table of Contents found in the metadata of the input document with the
|
||||
auto generated one.
|
||||
auto generated one.
|
||||
|
||||
The default way that the creation of the auto generated Table of Contents works is that, calibre will first try
|
||||
to add any detected chapters to the generated table of contents. You can learn how to customize the detection of chapters
|
||||
@ -481,7 +481,7 @@ For example, to remove all entries titles "Next" or "Previous" use::
|
||||
Next|Previous
|
||||
|
||||
The :guilabel:`Level 1,2,3 TOC` options allow you to create a sophisticated multi-level Table of Contents.
|
||||
They are XPath expressions that match tags in the intermediate XHTML produced by the conversion pipeline. See the
|
||||
They are XPath expressions that match tags in the intermediate XHTML produced by the conversion pipeline. See the
|
||||
:ref:`conversion-introduction` for how to get access to this XHTML. Also read the :ref:`xpath-tutorial`, to learn
|
||||
how to construct XPath expressions. Next to each option is a button that launches a wizard to help with the creation
|
||||
of basic XPath expressions. The following simple example illustrates how to use these options.
|
||||
@ -525,7 +525,7 @@ This will result in an automatically generated two level Table of Contents that
|
||||
.. warning::
|
||||
|
||||
Not all output formats support a multi level Table of Contents. You should first try with EPUB output. If that
|
||||
works, then try your format of choice.
|
||||
works, then try your format of choice.
|
||||
|
||||
Using images as chapter titles when converting HTML input documents
|
||||
---------------------------------------------------------------------
|
||||
@ -568,7 +568,7 @@ Set the :guilabel:`Level 1 TOC` setting to ``//h:h2/@title``. Then calibre will
|
||||
take the title from the value of the ``title`` attribute on the ``<h2>`` tags,
|
||||
instead of using the text inside the tag. Note the trailing ``/@title`` on the
|
||||
XPath expression, you can use this form to tell calibre to get the text from any
|
||||
attribute you like.
|
||||
attribute you like.
|
||||
|
||||
How options are set/saved for Conversion
|
||||
-------------------------------------------
|
||||
@ -591,11 +591,11 @@ conversion settings.
|
||||
|
||||
When you Bulk Convert a set of books, settings are taken in the following order (last one wins):
|
||||
|
||||
* From the defaults set in Preferences->Conversion
|
||||
* From the defaults set in Preferences->Conversion
|
||||
|
||||
* From the saved conversion settings for each book being converted (if
|
||||
any). This can be turned off by the option in the top left corner of the
|
||||
Bulk Conversion dialog.
|
||||
Bulk Conversion dialog.
|
||||
|
||||
* From the settings set in the Bulk conversion dialog
|
||||
|
||||
@ -606,13 +606,13 @@ override any book specific settings. So you should only bulk convert books
|
||||
together that need similar settings. The exceptions are metadata and input
|
||||
format specific settings. Since the Bulk Conversion dialog does not have
|
||||
settings for these two categories, they will be taken from book specific
|
||||
settings (if any) or the defaults.
|
||||
settings (if any) or the defaults.
|
||||
|
||||
.. note::
|
||||
|
||||
You can see the actual settings used during any conversion by clicking the rotating icon in the lower right corner
|
||||
You can see the actual settings used during any conversion by clicking the rotating icon in the lower right corner
|
||||
and then double clicking the individual conversion job. This will bring up a conversion log
|
||||
that will contain the actual settings used, near the top.
|
||||
that will contain the actual settings used, near the top.
|
||||
|
||||
Format specific tips
|
||||
----------------------
|
||||
@ -655,8 +655,8 @@ OpenOffice and save it in OpenOffice's format .odt. calibre can directly convert
|
||||
Convert TXT documents
|
||||
~~~~~~~~~~~~~~~~~~~~~~
|
||||
|
||||
TXT documents have no well defined way to specify formatting like bold, italics, etc, or document
|
||||
structure like paragraphs, headings, sections and so on, but there are a variety of conventions commonly
|
||||
TXT documents have no well defined way to specify formatting like bold, italics, etc, or document
|
||||
structure like paragraphs, headings, sections and so on, but there are a variety of conventions commonly
|
||||
used. By default calibre attempts automatic detection of the correct formatting and markup based on those
|
||||
conventions.
|
||||
|
||||
@ -668,9 +668,9 @@ TXT input supports a number of options to differentiate how paragraphs are detec
|
||||
|
||||
:guilabel:`Paragraph style: Block`
|
||||
Assumes one or more blank lines are a paragraph boundary::
|
||||
|
||||
|
||||
This is the first.
|
||||
|
||||
|
||||
This is the
|
||||
second paragraph.
|
||||
|
||||
@ -680,7 +680,7 @@ TXT input supports a number of options to differentiate how paragraphs are detec
|
||||
This is the first.
|
||||
This is the second.
|
||||
This is the third.
|
||||
|
||||
|
||||
:guilabel:`Paragraph style: Print`
|
||||
Assumes that every paragraph starts with an indent (either a tab or 2+ spaces). Paragraphs end when
|
||||
the next line that starts with an indent is reached::
|
||||
@ -688,7 +688,7 @@ TXT input supports a number of options to differentiate how paragraphs are detec
|
||||
This is the
|
||||
first.
|
||||
This is the second.
|
||||
|
||||
|
||||
This is the
|
||||
third.
|
||||
|
||||
@ -728,11 +728,11 @@ text in the unwrapping. Increase to include less. You can adjust this value in t
|
||||
|
||||
Also, they often have headers and footers as part of the document that will become included with the text.
|
||||
Use the Search and Replace panel to remove headers and footers to mitigate this issue. If the headers and footers are not
|
||||
removed from the text it can throw off the paragraph unwrapping. To learn how to use the header and footer removal options, read
|
||||
removed from the text it can throw off the paragraph unwrapping. To learn how to use the header and footer removal options, read
|
||||
:ref:`regexptutorial`.
|
||||
|
||||
Some limitations of PDF input are:
|
||||
|
||||
Some limitations of PDF input are:
|
||||
|
||||
* Complex, multi-column, and image based documents are not supported.
|
||||
* Extraction of vector images and tables from within the document is also not supported.
|
||||
* Some PDFs use special glyphs to represent ll or ff or fi, etc. Conversion of these may or may not work depending on just how they are represented internally in the PDF.
|
||||
@ -769,14 +769,14 @@ EPUB advanced formatting demo
|
||||
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
|
||||
|
||||
Various advanced formatting for EPUB files is demonstrated in this `demo file <https://calibre-ebook.com/downloads/demos/demo.epub>`_.
|
||||
The file was created from hand coded HTML using calibre and is meant to be used as a template for your own EPUB creation efforts.
|
||||
The file was created from hand coded HTML using calibre and is meant to be used as a template for your own EPUB creation efforts.
|
||||
|
||||
The source HTML it was created from is available `demo.zip <https://calibre-ebook.com/downloads/demos/demo.zip>`_. The settings used to create the
|
||||
EPUB from the ZIP file are::
|
||||
|
||||
ebook-convert demo.zip .epub -vv --authors "Kovid Goyal" --language en --level1-toc '//*[@class="title"]' --disable-font-rescaling --page-breaks-before / --no-default-epub-cover
|
||||
|
||||
Note that because this file explores the potential of EPUB, most of the advanced formatting is not going to work on readers less capable than calibre's built-in EPUB viewer.
|
||||
Note that because this file explores the potential of EPUB, most of the advanced formatting is not going to work on readers less capable than calibre's built-in EPUB viewer.
|
||||
|
||||
|
||||
Convert ODT documents
|
||||
@ -824,7 +824,7 @@ section of the conversion dialog. If you are generating a PDF to be used on a
|
||||
specific device, you can turn on the option to use the page size from the
|
||||
:guilabel:`output profile` instead. So if your output profile is set to Kindle, calibre
|
||||
will create a PDF with page size suitable for viewing on the small kindle
|
||||
screen.
|
||||
screen.
|
||||
|
||||
Headers and Footers
|
||||
^^^^^^^^^^^^^^^^^^^^
|
||||
@ -898,4 +898,3 @@ changes you like.
|
||||
|
||||
.calibre-pdf-toc .level-1 td:first-of-type { padding-left: 1.4em }
|
||||
.calibre-pdf-toc .level-2 td:first-of-type { padding-left: 2.8em }
|
||||
|
||||
|
Loading…
x
Reference in New Issue
Block a user