Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flow datastore exports #3161

Closed
muloem opened this issue Aug 14, 2019 · 5 comments
Closed

Flow datastore exports #3161

muloem opened this issue Aug 14, 2019 · 5 comments

Comments

@muloem
Copy link
Member

muloem commented Aug 14, 2019

No description provided.

muloem added a commit that referenced this issue Aug 14, 2019
@dlebrero
Copy link
Contributor

dlebrero commented Sep 3, 2019

We finally opted for an external cronjob running in k8s that uses the service accounts of akvo-flow-server-config.

Code is at the Akvo-platforms repo. See https://github.com/akvo/akvo-platform/issues/172.

The cronjob is running but right now just backs up the data from akvoflow-dev2. This is so that we can monitor its behaviour for a couple of weeks before enabling it for all instances.

The only other piece of work to be done is the check how long it takes the job to run once all the instances are enabled, as right now one instance backup is not stared until the previous backup finished, which we guess it can take a long time.

Making the backup async is easy (https://cloud.google.com/datastore/docs/export-import-entities#async-flag) but we are not sure what that will mean from an error reporting point of view.

@janagombitova
Copy link
Contributor

@dlebrero thank you for the work and the update. Two follow up questions:

  • how often will we back up the data?
  • when do we aim to enable the cronjob to production?

@dlebrero
Copy link
Contributor

dlebrero commented Sep 4, 2019

Once a week, due to the $$$ cost, which I think Emmanuel has already talked about with you.
In a couple of weeks, once it has run a couple of times and Emmanuel and Oriol have review the code.
I will run a one off backup for all instances today (hopefully)

@dlebrero
Copy link
Contributor

dlebrero commented Sep 4, 2019

Backed up all instances.

Found two issues, that I will let the Flow team handle:

  1. Backup fails due to too many kinds in instance #3169, which I fixed with a workaround. Needs a proper fix.
  2. backup not working for akvoflow-44 #3171

@janagombitova
Copy link
Contributor

As the work here is done, I am closing the issue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants