From b5bb81978a855eae57a556cbab2cc25e3b9dcbc1 Mon Sep 17 00:00:00 2001 From: "Thomas R. Koll" Date: Wed, 19 Jun 2024 08:46:52 +0200 Subject: [PATCH] Extend range of matching tex keywords It's not uncommon for tex files to start with % comments, which then allows matlab to claim a match. By extending the range to match \documentclass, \documentstyle and adding \usepackage this works for my files. On matlab, the match for comments has already caused a mis-match with pdf files TIKA-3328 , and as far as I can see it is the only one with a matcher for comments which I think is just ridiculous and should be removed altogether. --- .../main/resources/org/apache/tika/mime/tika-mimetypes.xml | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml b/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml index 279632018d..c4d0f49567 100644 --- a/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml +++ b/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml @@ -5068,12 +5068,13 @@ <_comment>TeX Source + + + - -