1. Overview

HPC3 is one of UCI’s shared-computing cluster that expands upon the condo-style model of GreenPlanet and the now-retired HPC clusters. Condo clusters have one model of expanding capacity - users purchase physical hardware.

HPC3 supports capacity expansion via:

Purchase hardware
Recharge allocations: purchase allocations of cycles by the core-hour
No-cost allocations: granting of cycles from UCI-purchased hardware. UCI has purchased both CPU and GPU nodes hardware as part of HPC3. Annual funds are used to add to this resource enabling RCIC to provide no-cost resource allocations to a larger fraction of the UCI research community.

The most significant change from a condo-only is that owner-dedicated queues no longer exist. Instead, researchers receive an allocation of available core-hours that are deposited into a bank account [1].

A bank account:

Has owners, and they can designate others who can charge to their account.
Can have multiple authorized users and allocated jobs are charged against the account.
For node owners, the theoretical capacity of their hardware is converted into available core hours and deposited into the bank account in addition to any granted or purchased hours.
Represents computing units (CPU-hour or GPU-hour) and logically functions as a prepaid account.
Can be filled in a variety of ways

Granted:

UCI core funds have purchased nodes that create the capacity for granted hours, or no-cost allocations.

Converted:

from hardware purchase. Each node can deliver core-hours = cores * 8760 hours/year theoretical maximum. For node owners, 95% of this maximum is deposited annually into an account for their use on any resource in the cluster.

Purchased:

recharge allocations. UCI researchers can purchase prepaid time to fill/augment their banks.

Quick links to most common information requests

HPC3 provides a rich collection of domain-specific Environment modules software packages.
What are Resource Allocations in detail.
How does Reallocation work.
How to use Recharge to buy core-hours or hardware.
Hardware Specs.

1.1. Goals & Policies

The HPC3 planning committee crafted policy guidelines to meet the following goals:

Enable access to a larger compute/analysis system than could be reasonably afforded on an individual lab basis.
Enable access to specialized nodes (e.g. large memory, GPU).
Foster a growing community across UCI to use scalable computing (HPC and HTC) for their scientific research and teaching.
Provide a well-managed software environment that underpins a reproducible scientific instrument. Fit seamlessly into the progression of:

desktop → lab cluster → campus → national -> commercial cloud
Enable construction of more-secure research environments.

HPC3 policies are needed to primarily address issues such:

How is contention for acquiring and using resources addressed?
How does one balance high utilization against wait times for jobs to start ?
What are principles to enable and support long-running jobs?
Are there ways to support priority boosting for jobs with specific deadlines (e.g. grants and publications)?
How can groups that contributed resources be ensured their fair share?

The questions above have no single right answer and this means that:

Any policy employed on HPC3 must be tuned to balance the wide range of needs specifically for the UCI research community.
Any implemented policy must be fluid and flexible.

Please see the following documents for more in-depth information.

A Vision For Research CyberInfrastructure at UCI: provides the rationale for what Research Cyberinfrastructure should be and some new features that need to be implemented.
Policy/Usage Document: provides a draft document started in 2018. The RCIC began the process of crafting this document that could provide the framework for creating HPC3 and the principles by which it would run. The HPC3 subcommittee of the RCIC advisory committee edited and refined the initial version. Going forward, this document will continually be updated to reflect adjustments and refinements.
HPC3 Policy: provides an executive summary.

1. Overview

1.1. Goals & Policies

1.2. Fair Sharing