mirror of
https://github.com/kovidgoyal/calibre.git
synced 2025-07-09 03:04:10 -04:00
Full text search: imporve <ruby> parsing when indexing books
Following the spec of the <ruby> tag, it's better to ignore only the sub-tags <rt>, <rp> and <rtc>, because the root text inside the <ruby> tag is what we want indexing.
This commit is contained in:
parent
25ad85a69c
commit
62916ee574
@ -20,7 +20,7 @@ class SimpleContainer(ContainerBase):
|
||||
tweak_mode = True
|
||||
|
||||
|
||||
skipped_tags = frozenset({'style', 'title', 'script', 'head', 'img', 'svg', 'math', 'ruby'})
|
||||
skipped_tags = frozenset({'style', 'title', 'script', 'head', 'img', 'svg', 'math', 'rt', 'rp', 'rtc'})
|
||||
|
||||
|
||||
def tag_to_text(tag):
|
||||
|
Loading…
x
Reference in New Issue
Block a user