mirror of
				https://github.com/paperless-ngx/paperless-ngx.git
				synced 2025-10-26 08:12:34 -04:00 
			
		
		
		
	
		
			
				
	
	
		
			230 lines
		
	
	
		
			7.1 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
	
	
			
		
		
	
	
			230 lines
		
	
	
		
			7.1 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
	
	
| 
 | |
| ************
 | |
| The REST API
 | |
| ************
 | |
| 
 | |
| 
 | |
| Paperless makes use of the `Django REST Framework`_ standard API interface.
 | |
| It provides a browsable API for most of its endpoints, which you can inspect
 | |
| at ``http://<paperless-host>:<port>/api/``. This also documents most of the
 | |
| available filters and ordering fields.
 | |
| 
 | |
| .. _Django REST Framework: http://django-rest-framework.org/
 | |
| 
 | |
| The API provides 5 main endpoints:
 | |
| 
 | |
| *   ``/api/correspondents/``: Full CRUD support.
 | |
| *   ``/api/document_types/``: Full CRUD support.
 | |
| *   ``/api/documents/``: Full CRUD support, except POSTing new documents. See below.
 | |
| *   ``/api/logs/``: Read-Only.
 | |
| *   ``/api/tags/``: Full CRUD support.
 | |
| 
 | |
| All of these endpoints except for the logging endpoint 
 | |
| allow you to fetch, edit and delete individual objects
 | |
| by appending their primary key to the path, for example ``/api/documents/454/``.
 | |
| 
 | |
| In addition to that, the document endpoint offers these additional actions on
 | |
| individual documents:
 | |
| 
 | |
| *   ``/api/documents/<pk>/download/``: Download the original document.
 | |
| *   ``/api/documents/<pk>/thumb/``: Download the PNG thumbnail of a document.
 | |
| *   ``/api/documents/<pk>/preview/``: Display the original document inline,
 | |
|     without downloading it.
 | |
| 
 | |
| .. hint::
 | |
| 
 | |
|     Paperless used to provide these functionality at ``/fetch/<pk>/preview``,
 | |
|     ``/fetch/<pk>/thumb`` and ``/fetch/<pk>/doc``. Redirects to the new URLs
 | |
|     are in place. However, if you use these old URLs to access documents, you
 | |
|     should update your app or script to use the new URLs.
 | |
| 
 | |
| .. note::
 | |
| 
 | |
|     The document endpoint provides tags, document types and correspondents as
 | |
|     ids in their corresponding fields. These are writeable. Paperless also
 | |
|     offers read-only objects for assigned tags, types and correspondents,
 | |
|     however, these might be removed in the future. As for now, the front end
 | |
|     requires them.
 | |
| 
 | |
| Authorization
 | |
| #############
 | |
| 
 | |
| The REST api provides three different forms of authentication.
 | |
| 
 | |
| 1.  Basic authentication
 | |
| 
 | |
|     Authorize by providing a HTTP header in the form
 | |
|     
 | |
|     .. code::
 | |
| 
 | |
|         Authorization: Basic <credentials>
 | |
|     
 | |
|     where ``credentials`` is a base64-encoded string of ``<username>:<password>``
 | |
| 
 | |
| 2.  Session authentication
 | |
| 
 | |
|     When you're logged into paperless in your browser, you're automatically
 | |
|     logged into the API as well and don't need to provide any authorization
 | |
|     headers.
 | |
| 
 | |
| 3.  Token authentication
 | |
| 
 | |
|     Paperless also offers an endpoint to acquire authentication tokens.
 | |
| 
 | |
|     POST a username and password as a form or json string to ``/api/token/``
 | |
|     and paperless will respond with a token, if the login data is correct.
 | |
|     This token can be used to authenticate other requests with the
 | |
|     following HTTP header:
 | |
| 
 | |
|     .. code::
 | |
| 
 | |
|         Authorization: Token <token>
 | |
|     
 | |
|     Tokens can be managed and revoked in the paperless admin.
 | |
| 
 | |
| Searching for documents
 | |
| #######################
 | |
| 
 | |
| Paperless-ng offers API endpoints for full text search. These are as follows:
 | |
| 
 | |
| ``/api/search/``
 | |
| ================
 | |
| 
 | |
| Get search results based on a query.
 | |
| 
 | |
| Query parameters:
 | |
| 
 | |
| *   ``query``: The query string. See
 | |
|     `here <https://whoosh.readthedocs.io/en/latest/querylang.html>`_
 | |
|     for details on the syntax.
 | |
| *   ``page``: Specify the page you want to retrieve. Each page
 | |
|     contains 10 search results and the first page is ``page=1``, which
 | |
|     is the default if this is omitted.
 | |
| 
 | |
| Result list object returned by the endpoint:
 | |
| 
 | |
| .. code:: json
 | |
| 
 | |
|     {
 | |
|         "count": 1,
 | |
|         "page": 1,
 | |
|         "page_count": 1,
 | |
|         "corrected_query": "",
 | |
|         "results": [
 | |
|             
 | |
|         ]
 | |
|     }
 | |
| 
 | |
| *   ``count``: The approximate total number of results.
 | |
| *   ``page``: The page returned to you. This might be different from
 | |
|     the page you requested, if you requested a page that is behind
 | |
|     the last page. In that case, the last page is returned.
 | |
| *   ``page_count``: The total number of pages.
 | |
| *   ``corrected_query``: Corrected version of the query string. Can be null.
 | |
|     If not null, can be used verbatim to start a new query.
 | |
| *   ``results``: A list of result objects on the current page.
 | |
| 
 | |
| Result object:
 | |
| 
 | |
| .. code:: json
 | |
| 
 | |
|     {
 | |
|         "id": 1,
 | |
|         "highlights": [
 | |
|             
 | |
|         ],
 | |
|         "score": 6.34234,
 | |
|         "rank": 23,
 | |
|         "document": {
 | |
|             
 | |
|         }
 | |
|     }
 | |
| 
 | |
| *   ``id``: the primary key of the found document
 | |
| *   ``highlights``: an object containing parsable highlights for the result.
 | |
|     See below.
 | |
| *   ``score``: The score assigned to the document. A higher score indicates a
 | |
|     better match with the query. Search results are sorted descending by score.
 | |
| *   ``rank``: the position of the document within the entire search results list.
 | |
| *   ``document``: The full json of the document, as returned by
 | |
|     ``/api/documents/<id>/``.
 | |
| 
 | |
| Highlights object:
 | |
| 
 | |
| Highlights are provided as a list of fragments. A fragment is a longer section of
 | |
| text from the original document.
 | |
| Each fragment contains a list of strings, and some of them are marked as a highlight.
 | |
| 
 | |
| .. code:: json
 | |
| 
 | |
|     [
 | |
|         [
 | |
|             {"text": "This is a sample text with a "},
 | |
|             {"text": "highlighted", "term": 0},
 | |
|             {"text": " word."}
 | |
|         ],
 | |
|         [
 | |
|             {"text": "Another", "term": 1},
 | |
|             {"text": " fragment with a highlight."}
 | |
|         ]
 | |
|     ]
 | |
|     
 | |
| 
 | |
| 
 | |
| When ``term`` is present within a string, the word within ``text`` should be highlighted.
 | |
| The term index groups multiple matches together and words with the same index
 | |
| should get identical highlighting.
 | |
| A client may use this example to produce the following output:
 | |
| 
 | |
| ... This is a sample text with a **highlighted** word. ... **Another** fragment with a highlight. ...
 | |
| 
 | |
| ``/api/search/autocomplete/``
 | |
| =============================
 | |
| 
 | |
| Get auto completions for a partial search term.
 | |
| 
 | |
| Query parameters:
 | |
| 
 | |
| *   ``term``: The incomplete term.
 | |
| *   ``limit``: Amount of results. Defaults to 10.
 | |
| 
 | |
| Results returned by the endpoint are ordered by importance of the term in the
 | |
| document index. The first result is the term that has the highest Tf/Idf score
 | |
| in the index.
 | |
| 
 | |
| .. code:: json
 | |
| 
 | |
|     [
 | |
|         "term1",
 | |
|         "term3",
 | |
|         "term6",
 | |
|         "term4"
 | |
|     ]
 | |
| 
 | |
| 
 | |
| .. _api-file_uploads:
 | |
| 
 | |
| POSTing documents
 | |
| #################
 | |
| 
 | |
| The API provides a special endpoint for file uploads:
 | |
| 
 | |
| ``/api/documents/post_document/``
 | |
| 
 | |
| POST a multipart form to this endpoint, where the form field ``document`` contains
 | |
| the document that you want to upload to paperless. The filename is sanitized and
 | |
| then used to store the document in a temporary directory, and the consumer will
 | |
| be instructed to consume the document from there.
 | |
| 
 | |
| The endpoint supports the following optional form fields:
 | |
| 
 | |
| *   ``title``: Specify a title that the consumer should use for the document.
 | |
| *   ``correspondent``: Specify the ID of a correspondent that the consumer should use for the document.
 | |
| *   ``document_type``: Similar to correspondent.
 | |
| *   ``tags``: Similar to correspondent. Specify this multiple times to have multiple tags added
 | |
|     to the document.
 | |
| 
 | |
| The endpoint will immediately return "OK" if the document consumption process
 | |
| was started successfully. No additional status information about the consumption
 | |
| process itself is available, since that happens in a different process.
 |