Encrypt and compress read data

Currently, both the pipeline and the webapp use reads at the record level. This is good for fine-grained access, but not ideal. We really should move to a bucketed/compressed/encrypted model, with (say) packets of 5k reads compressed and encrypted. 

If we keep this relatively small, there won't be  huge penalty accessing a single read. There may even be a performance improvement as we reduce the disk, I/O, and index sizes, which is actually likely. 

This issue affects both the pipeline and the webapp, as the pipeline writes the data and the webapp reads it. So both Python and Java need to agree on data storage and compression systems. See: https://github.com/capsid/capsid-pipeline/issues/8


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encrypt and compress read data #63

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Encrypt and compress read data #63

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions