High Performance Storage System

Incremental Scalability
Based on storage needs and deployment schedules, HPSS scales incrementally by adding computer, network and storage resources. A single HPSS namespace can scale from petabytes of data to exabytes of data, from millions of files to billions of files, and from a few file-creates per second to thousands of file-creates per second.
About HPSS   :    HPSS Big Picture - metadata storage

RHEL Platform   |   Metadata Storage   |   Disk and Tape Storage   |   User Interfaces
HPSS File Operations Per Second

Every HPSS file operation requires one or more Db2 database transactions (e.g. an HPSS file create with data requires four Db2 metadata transactions). HPSS services help with planning, provisioning, and configuring Db2 to meet your transaction requirements.

HPSS Backed by Db2

Metadata describing the files in HPSS are managed by Db2 relational databases. Each relational database has tables and indexes, transaction log files, archived transaction log files, and database backup files. All Db2 transaction log files and the Db2 database backup files are often called the Db2 recovery objects.

Mandatory HPSS Redundancy

All Db2 database files must be protected by RAID storage: (1) Db2 tables and index files; (2) Db2 transaction log files; (3) Db2 log archive files; and (4) Db2 backups files. The Db2 transaction log files are further protected by configuring Db2 log mirroring to make a second copy on a separate RAID-device. The Db2 log archive files are also mirrored to a separate RAID-device. All Db2 recovery files must be protected using your existing site disaster recovery plan and calls for the HPSS Db2 files to be stored on remote-disk or remote-tape. Your site's backup serves as the second copy of the Db2 database backup files. It is not acceptable to store the Db2 backups into HPSS.

Optional HPSS Metadata Redundancy

When Db2 metadata must survive a complete chassis failure, a second local server is provisioned with the RAID-device for the second copy of Db2 recovery objects discussed earlier. The separate chassis can be mounted in a separate rack to protect the HPSS metadata from a rack failure.

Db2 log shipping is used to protect HPSS metadata from a wider range of failures, including a site failure, and to help reduce downtime caused by these failures. Db2 log shipping is used to replicate HPSS Db2 metadata spanning geography.

HPSS Db2 Capacity

Here is an example of Db2 storage for 1 billion HPSS files in a single chassis metadata configuration:
  1. High IOPS storage (at least two RAID devices)
    1. 1 TB RAID: Db2 transaction log files and half of the Db2 tables and indexes
    2. 1 TB RAID: Db2 transaction log mirror files and half of the Db2 tables and indexes
  2. High-bandwidth storage (at least two RAID devices)
    1. 4 TB RAID: Db2 archived transaction log files and half of the Db2 database backup files
    2. 4 TB RAID: Db2 archived transaction log mirror files and half of the Db2 database backup files
  3. Integrating Db2 recovery files into site backup
    1. Db2 recovery files are typically sent to tape using backup software (e.g. Bacula)
    2. 8 TB stores about seven days of Db2 recovery files
SSD vs. HDD for metadata storage

The solid-state disks (SSD) are cost-effective for the high IOPS (Db2 tables, indexes and logs), high bandwidth (for faster backup/restore times), and low bit-error-rate (high integrity) requirements of HPSS Db2. The new NVMe (non-volatile memory express) is a host controller interface and storage protocol created to accelerate the transfer of data to-and-from solid-state drives (SSDs) over a computer's high-speed Peripheral Component Interconnect Express (PCIe) bus and can be found on modern RHEL Platforms. SSDs deliver tens of thousands of write IOPS, hundreds of thousands of read IOPS, and transfer rates approaching a gigabyte per second per device. Spinning hard disk drives (HDD) are an ideal solution for high capacity data that are not frequently accessed, and are no longer ideal for HPSS Db2 metadata.

< Home

Come meet with us!
COVID-19 has disrupted the 2021 HPSS User Forum (HUF) and the Karlsruhe Institute of Technology (KIT) in Karlsruhe, Germany is no longer hosting the event. The 2021 HUF will be hosted online for six days spread across three weeks in October 2021 with no admission cost. This will be a great opportunity to hear from HPSS users, collaboration developers, testers, support folks and leadership (from IBM and DOE Labs) - Learn More. Please contact us if you are not a customer but would like to attend.

The 2021 international conference for high performance computing, networking, storage and analysis will be in St. Louis, MO from November 15th through 18th, 2021 - Learn More. As we do each year, we are scheduling and meeting with customers via IBM Single Client Briefings. Please contact your local IBM client executive or contact us to schedule a HPSS Single Client Briefing to meet with the IBM business and technical leaders of HPSS.

HPSS @ STS 2022
The 4th Annual Storage Technology Showcase is in the planning stage, but HPSS expects to support the event in March of 2022. Check out their web site - Learn More. We expect an update in early fall 2021.

HPSS @ MSST 2022
The 37th International Conference on Massive Storage Systems and Technology will be in Santa Clara, California in May of 2022 - Learn More. Please contact us if you would like to meet with the IBM business and technical leaders of HPSS at Santa Clara University.

What's New?
DOE Announces HPSS Milestone - Todd Heer, Deputy Program Lead, Advanced Simulation and Computing (ASC) Facilities, Operations, and User Support (FOUS), announced that DOE High Performance Storage Systems (HPSS) eclipse one exabyte in stored data.

Atos Press Release - Atos boosts Météo-France’s data storage capacity to over 1 exabyte in 2025 to improve numerical modeling and climate predictions. Want to read more?

HPSS 9.2 Release - HPSS 9.2 was released on May 11th, 2021 and introduces eight new features and numerous minor updates.

HPSS 9.1 Release - HPSS 9.1 was released on September 24th, 2020 and introduces a few new features.

HUF 2020 - The HPSS User Forum was hosted virtually at no cost in October 2020.

HPSS 9.1 Release - HPSS 9.1 was released on September 24th, 2020 and introduces a few new features.

HPSS 8.3 Release - HPSS 8.3 was released on March 31st, 2020 and introduces one new feature and many minor changes.

Capacity Leader - ECMWF (European Center for Medium-Range Weather Forecasts) has a single HPSS namespace with over 567 PB spanning over 399 million files.

File-Count Leader - LLNL (Lawrence Livermore National Laboratory) has a single HPSS namespace with over 65 PB spanning 1.540 billion files.

Older News - Want to read more?
  • LLNL Logo
  • LLNL Logo
  • NERSC Logo
  • ORNL Logo
  • SANDIA Logo
  • IBM Logo
  • ANL Logo
  • BNL Logo
  • CEA Logo
  • DKRZ Logo
  • ECMWF Logo
  • HLRS Logo
  • IN2P3 Logo
  • IU Logo
  • JAXA Logo
  • KEK Logo
  • NASA LaRC Logo
  • NASA ASDC Logo
  • UCAR Logo
  • NOAA NCDC Logo
  • NOAA Logo
  • NCEP Logo
  • PNNL Logo
  • SLAC Logo
  • MetOffice Logo
  • SciNet Logo
  • SSC Logo
  • UTAS Logo
Home    |    About HPSS    |    Services    |    Contact us
Copyright 1992 - 2021, HPSS Collaboration. All Rights Reserved.