Skip to content

Bioinformatics

The NWGC has the computational infrastructure needed to meet the data management, data storage, and data sharing demands associated with large-scale data production. The NWGC has developed and implemented an automated pipeline for processing data from our collection of Illumina, PacBio, and Oxford Nanopore sequencers.  

We have a dedicated high-performance computing system that includes analysis cluster servers (~3,000 total CPU cores), storage (2.4PB usable), and backup system (4 LTO-8, 12 LTO-7 and 4 LTO-6 drives, 12/6TB/2.5TB native capacity respectively) that are interconnected with high-speed, low-latency 10GbE, 40GbE, and 100GbE allowing data to be quickly moved through the analysis pipeline. We store quality-associated sequences, mapping data, variant calls, etc. on our cluster servers for up to six months. Our dedicated tape library ensures the required throughput for data archiving, backups, and disaster recovery.