Skip to content

diskshima/pdf-searcher

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PDF Searcher

Setup

  1. Install Node modules.
    npm i
  2. Run Elasticsearch with Docker.
    docker run -p 9200:9200 -p 9300:9300 -e "discovery.type=single-node" docker.elastic.co/elasticsearch/elasticsearch:6.5.1
  3. Install kuromoji.
    docker exec -it CONTAINER_ID /bin/sh
    sh-4.2# bin/elasticsearch-plugin install analysis-kuromoji    # Inside Docker container
  4. Add document mappings.
    curl -X PUT -H 'Content-type: application/json' http://localhost:9200/documents --data '@es_data/documents_mappings.json'

Building

npm run build

Running

Adding documents

npm run main add PATH_TO_PDF

Searching

npm run main search SEARCH_TERM

Debugging

  1. Replace the main above with debug
    npm debug main add PATH_TO_PDF
  2. Open Chrome and navigate to chrome://inspect/.
  3. Click Open dedicated DevTools for Node. DevTools should automatically connect to the node process.

About

PDF search using TypeScript and Elasticsearch.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors