multi-reindex

Reindex multiple elasticsearch indices, save your progress, mutate your data in-flight.

WARNING:

This is 99% done, but is still rough on the edges. However, it will be 1.0.0 within a week or so.

Requirements

Currently chillastic relies on redis to store the current state of the transfer and any pending jobs.

The easiest way to setup if you have docker installed is to run the following:

docker run -it -d -p 6379:6379 --name redis redis:3

Otherwise, point the application at another redis install using the REDIS_HOST environment variable.

Steps

npm install --save chillastic

Pass in the configuration:

var Chillastic = require('./index');

let configuration = {
  source:      {
    host:       'localhost:9200',
    apiVersion: '1.4'
  },
  destination: {
    host:       'localhost:9201',
    apiVersion: '2.2'
  },
  redis:       {
    hostname: 'localhost',
    port:     6379
  },
  concurrency: 3,
  indices:     '*',
  data:        '*'
};

Chillastic(configuration);

Examples

To run these examples, either change the source and destination in the command, or if you have docker installed, run ./scripts/docker_dependencies.sh.

To create some test data, you can use the elasticsearch data generator found here: https://github.com/oliver006/elasticsearch-test-data

Generate the test data using that generator with:

python es_test_data.py --es_url=http://localhost:9200 --index-name=testdata-2015-05-21 --count=100000
python es_test_data.py --es_url=http://localhost:9200 --index-name=testdata-2015-03-01 --count=100000
python es_test_data.py --es_url=http://localhost:9200 --index-name=testdata-2015-05-05 --count=100000
python es_test_data.py --es_url=http://localhost:9200 --index-name=not-testdata --count=100000

Transfer all index configurations, and all indices and types from one elasticsearch to another.

node index.js --source localhost:9200 --dest localhost:9201 -i '*' -d '*'

Transfer just index data (not mappings, settings, aliases, etc) for indices starting with testing.

node index.js --source localhost:9200 --dest localhost:9201 -d 'testing*'

Transfer just index data, and use the indexDate mutator.

node index.js --source localhost:9200 --dest localhost:9201 -d '*' --mutators './examples/mutators/indexDate.js'

How it works

So what does all this do?

Operations are performed in the following order. None of these steps are mandatory and are only executed if the arguments are provided.

Index configurations are run through any relevant mutators and transferred
Templates are run through any relevant mutators and transferred
Find indices for data transfer based on names provided, then filter and sort those indices.
Find all types for each of those indices and filter as needed.
A list of pending jobs is created in redis. Each job consists of a index and type, prioritized based on the sorting function provided
The requested number of workers are created (1 to # of CPUs)
Each worker removes a job from the queue, and then adds it to the completed set once it's been completed with no errors.

If you are forced to stop and restart the process, as long as the completed jobs are left in redis they will not be reprocessed.

Error Handling

Any errors while transferring the index configurations or templates will halt the process.

If an es_rejected_execution_exception is detected during data transfer, those records are retried after a random sleep as this only indicates the target is overwhelmed by input. Any other type of error during data transfer results in the entire job failing and being re-added to the end of the job queue to be tried again later.

Future features

Dry run mode
Allow data transfers to be sub-divided based on range queries to a specific field (eg split by a data field)
Automatic scaling based on search response time from the source, and errors during writes to destination to maximize throughput
Allow multiple transfer nodes to be brought up and coordinated via redis.
Allow use of flat file for coordination instead of redis

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
app		app
config		config
examples		examples
scripts		scripts
tests		tests
.eslintrc.yml		.eslintrc.yml
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
circle.yml		circle.yml
gulpfile.js		gulpfile.js
index.js		index.js
package.json		package.json
test.js		test.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

multi-reindex

WARNING:

Requirements

Steps

Examples

How it works

Error Handling

Future features

About

Uh oh!

Releases

Packages

Languages

License

node-packages/multi-reindex

Folders and files

Latest commit

History

Repository files navigation

multi-reindex

WARNING:

Requirements

Steps

Examples

How it works

Error Handling

Future features

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages