You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+9-6Lines changed: 9 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,7 @@ It allows you to index your content into the usual Nextcloud database.
6
6
7
7
## Compatibility
8
8
9
-
The extension requires your Nextcloud database to be MySQL (tested) or PostgreSQL (currently untested). SQLite might work as well, but isn't yet implemented.
9
+
The extension requires your Nextcloud database to be MySQL or PostgreSQL.
10
10
11
11
## Status
12
12
@@ -17,21 +17,24 @@ What works:
17
17
* Indexing of text in PDF documents
18
18
* This is done by extracting the text via [Smalot/PdfParser].
19
19
* This app itself does *NOT* do optical chracter recognition (OCR)! If your files don't already contain the extracted text, maybe the [files_fulltextsearch_tesseract] app is for you. I haven't tested it together with this app.
20
-
* MySQL
20
+
* MySQL (tested in CI pipeline and in real world usage)
21
+
* PostgreSQL (tested in CI pipeline)
22
+
* Plainly assumes "english" configuration (which influences stopwords and normalization)
21
23
* Basic searching
22
-
* If the database is MySQL, it uses [Boolean Full-Text Searches], so you can use operators like `+` and `-`, as well as a trailing `*` wildcard
24
+
* If the database is MySQL, it uses [Boolean Full-Text Searches], so you can use operators like `+` and `-`, as well as a trailing `*` wildcard
25
+
* If the database is PostgreSQL, the query is converted using [`websearch_to_tsquery`], so you can use `-` for exclusions and quote text to enforce word groups
* Indexing of Office documents: The upstream [fulltextsearch_elasticsearch] app simply passes the files on to the [Elasticsearch Attachment processor], which in turn uses [Apache Tika] for processing. Since I want to keep this app lean, I don't want to pull in any Java dependencies.
30
-
* "Advanced" features of the full text search framework. There are fields for tags, metatags, subtags, parts, excerpts and whatnot. I have no idea yet what they are used for. The app just stores them on indexing and returns them in search results, but doesn't search those fields.
31
-
* PostgreSQL: Could work, but I haven't tested it. Might need small fixes, and plainly assumes "english" configuration (which influences stopwords and normalization).
35
+
* "Advanced" features of the full text search framework. There are fields for tags, metatags, subtags, parts and whatnot. I have no idea yet what they are used for. The app just stores them on indexing and returns them in search results, but doesn't search those fields.
32
36
* SQLite: Might be implementable, but I haven't spent more time than a quick search for "fulltext search sqlite"
0 commit comments