High Performance Storage System

HPSS Logo
Incremental Scalability
Based on storage needs and deployment schedules, HPSS scales incrementally by adding computer, network and storage resources. A single HPSS namespace can scale from petabytes of data to exabytes of data, from millions of files to billions of files, and from a few file-creates per second to thousands of file-creates per second.
About HPSS   :    HPSS Big Picture - metadata storage



RHEL Platform   |   Metadata Storage   |   Disk and Tape Storage   |   User Interfaces
HPSS File Operations Per Second

Every HPSS file operation requires one or more Db2 database transactions (e.g. an HPSS file create with data requires four Db2 metadata transactions). HPSS services help with planning, provisioning, and configuring Db2 to meet your transaction requirements.

HPSS Backed by Db2

Metadata describing the files in HPSS are managed by Db2 relational databases. Each relational database has tables and indexes, transaction log files, archived transaction log files, and database backup files. All Db2 transaction log files and the Db2 database backup files are often called the Db2 recovery objects.

Mandatory HPSS Redundancy

All Db2 database files must be protected by RAID storage: (1) Db2 tables and index files; (2) Db2 transaction log files; (3) Db2 log archive files; and (4) Db2 backups files. The Db2 transaction log files are further protected by configuring Db2 log mirroring to make a second copy on a separate RAID-device. The Db2 log archive files are also mirrored to a separate RAID-device. All Db2 recovery files must be protected using your existing site disaster recovery plan and calls for the HPSS Db2 files to be stored on remote-disk or remote-tape. Your site's backup serves as the second copy of the Db2 database backup files. It is not acceptable to store the Db2 backups into HPSS.

Optional HPSS Metadata Redundancy

When Db2 metadata must survive a complete chassis failure, a second local server is provisioned with the RAID-device for the second copy of Db2 recovery objects discussed earlier. The separate chassis can be mounted in a separate rack to protect the HPSS metadata from a rack failure.

Db2 log shipping is used to protect HPSS metadata from a wider range of failures, including a site failure, and to help reduce downtime caused by these failures. Db2 log shipping is used to replicate HPSS Db2 metadata spanning geography.

HPSS Db2 Capacity

Here is an example of Db2 storage for 1 billion HPSS files in a single chassis metadata configuration:
  1. High IOPS storage (at least two RAID devices)
    1. 1 TB RAID: Db2 transaction log files and half of the Db2 tables and indexes
    2. 1 TB RAID: Db2 transaction log mirror files and half of the Db2 tables and indexes
  2. High-bandwidth storage (at least two RAID devices)
    1. 4 TB RAID: Db2 archived transaction log files and half of the Db2 database backup files
    2. 4 TB RAID: Db2 archived transaction log mirror files and half of the Db2 database backup files
  3. Integrating Db2 recovery files into site backup
    1. Db2 recovery files are typically sent to tape using backup software (e.g. Bacula)
    2. 8 TB stores about seven days of Db2 recovery files
SSD vs. HDD for metadata storage

The solid-state disks (SSD) are cost-effective for the high IOPS (Db2 tables, indexes and logs), high bandwidth (for faster backup/restore times), and low bit-error-rate (high integrity) requirements of HPSS Db2. The new NVMe (non-volatile memory express) is a host controller interface and storage protocol created to accelerate the transfer of data to-and-from solid-state drives (SSDs) over a computer's high-speed Peripheral Component Interconnect Express (PCIe) bus and can be found on modern RHEL Platforms. SSDs deliver tens of thousands of write IOPS, hundreds of thousands of read IOPS, and transfer rates approaching a gigabyte per second per device. Spinning hard disk drives (HDD) are an ideal solution for high capacity data that are not frequently accessed, and are no longer ideal for HPSS Db2 metadata.

< Home

Come meet with us!
HPSS @ STS 2021
The 3nd Annual Storage Technology Showcase is in the planning stage, but HPSS expects to support the event in March of 2021. Check out their web site - Learn More. We expect an update later in 2020.

HPSS @ ISC21
The 2021 international conference for high performance computing, networking, and storage will be in Frankfurt, Germany from June 27th through July 1st, 2021 - Learn More. Come visit the HPSS folks at the IBM booth and contact us if you would like to schedule a face-to-face meeting with us in Frankfurt.

2021 HUF
The 2021 HPSS User Forum (HUF) is being hosted by Karlsruhe Institute of Technology (KIT) in Karlsruhe, Germany from September 27th through September 30th, 2021. This is a great place to meet HPSS users, collaboration developers and testers (from IBM and DOE Labs), support folks, and leadership. More details coming soon.

HPSS @ SC21 - VIRTUAL
The 2021 international conference for high performance computing, networking, storage and analysis will be in St. Louis, MO from November 15th through 18th, 2021 - Learn More. As we do each year, we are scheduling and meeting with customers via IBM Single Client Briefings. Please contact your local IBM client executive or contact us to schedule a HPSS Single Client Briefing to meet with the IBM business and technical leaders of HPSS.

What's New?
Atos Press Release - Atos boosts Météo-France’s data storage capacity to over 1 exabyte in 2025 to improve numerical modeling and climate predictions. Want to read more?

HUF 2020 - The HPSS User Forum was hosted virtually at no cost in October 2020.

HPSS 9.1 Release - HPSS 9.1 was released on September 24th, 2020 and introduces a few new features.

HPSS 8.3 Release - HPSS 8.3 was released on March 31st, 2020 and introduces one new feature and many minor changes.

HPSS 8.2 Release - HPSS 8.2 was released on December 6th, 2019 and introduces a few new features.

New Globus DSI - Version 2.9 of the HPSS DSI is now available from the GitHub release page. It provides the capability to resume interrupted Globus transfers.

Lots Of Data - In November 2019 IBM/HPSS delivered a system to a customer in Canada and demonstrated a sustained tape ingest rate of 11,574 MB/sec (1 PB/day peak tape ingest) while simultaneously demonstrating a sustained tape recall rate of 8,832 MB/sec (791 TB/day peak tape recall). HPSS pushed four 13-frame IBM TS4500 tape libraries (scheduled to house over 500 PB of tape media) to 2,168 mounts/hour.

HPSS 8.1 Release - HPSS 8.1 was released on October 1st, 2019 and introduces a few new features.

July 2019 - Argonne Team Breaks Record for Globus Data Movement from the Summit supercomputer at Oak Ridge National Laboratory to HPSS tape.

Capacity Leader - ECMWF (European Center for Medium-Range Weather Forecasts) has a single HPSS namespace with over 556 PB spanning over 405 million files.

File-Count Leader - LLNL (Lawrence Livermore National Laboratory) has a single HPSS namespace with over 63 PB spanning 1.525 billion files.

Explosive data growth - HPSS Collaboration leadership from Lawrence Berkeley National Laboratory's National Energy Research Scientific Computing Center (NERSC) helped author the "NERSC Storage 2020" report, and NERSC trusts HPSS to meet their immediate and long term data storage challenges.

Older News - Want to read more?
  • LLNL Logo
  • LLNL Logo
  • NERSC Logo
  • ORNL Logo
  • SANDIA Logo
  • IBM Logo
  • ANL Logo
  • BNL Logo
  • CEA Logo
  • DKRZ Logo
  • ECMWF Logo
  • HLRS Logo
  • IN2P3 Logo
  • IU Logo
  • JAXA Logo
  • KEK Logo
  • NASA LaRC Logo
  • UCAR Logo
  • NOAA NCDC Logo
  • NCSA Logo
  • NCEP Logo
  • PNNL Logo
  • Northrop Grumman Logo
  • SLAC Logo
  • MetOffice Logo
  • SciNet Logo
  • NOAA Logo
Home    |    About HPSS    |    Services    |    Contact us
Copyright 1992 - 2020, HPSS Collaboration. All Rights Reserved.