HADSS

HADSS is short for High Availability Distributed Storage System.

HADSS can be used as backend of a Object Storage Service.

Design Philosophy

High Availability in HADSS stands for:

No Single Point Failure.
Responsive even when clusters scale up/down.

HADSS can answer basic requests even when the entire Index Layer is down.

Architecture

HADSS consists of 3 layers:

Gateway Layer
Index Layer
Storage Layer

Each Layer contain nodes in different regions, data centers or racks.

Gateway Layer

Gateway Layer faces users' requests.

Gateway Layer warps different kinds of storage services into RPCs to the Storage Layer and Index Layer. e.g. The business logic for an Object Storage Service is usually on Gateway Layer.

The best practice in production is to place Load Balancers before Gateway Nodes.

Index Layer

Index Layer monitors Storage Layer status and performs recovery. e.g.

Monitor Storage Nodes cluster scales up/down
Monitor health of Storage Nodes
Provide Storage Nodes' information to other layers
Detect and recover data corruptions (Hard disk sometimes corrupts files)
Detect and recover Node Failures

Storage Layer

Storage Layer stores. It receives (Handler, ConsistencyPolicy, NodeStatusVersion(to be implemented)]) to store a file. It can auto-balancing the storage usage between storage nodes by NodeStatus.

How it works

The System's sequence diagram when storing a file by Object Storage Gateway.

sequenceDiagram
    autonumber
    actor User
    participant GatewayNode
    participant StorageNode
    participant StorageNode..N
    
    Note right of User: may be behind of LoadBalancers
    User->>GatewayNode: Sending a file.
    activate GatewayNode
    
    loop sometimes
        GatewayNode->IndexLayer: Syncing StorageClusters' nodestatus which contains all storage nodes infomation.
    end
    
    GatewayNode->>GatewayNode: Calculates metadata and determines StorageNode used to store data.
    
    Note over GatewayNode, GatewayNode: GatewayNode storing metadata and file.
    Note left of StorageNode: If StorageNode is not actually leader of the node group,
    Note left of StorageNode: it will forward the request to the real leader.
    GatewayNode-)StorageNode: Putting data(metadata) async.
    activate StorageNode
    StorageNode-)StorageNode..N: Copy data
    activate StorageNode..N
    Note over StorageNode, StorageNode..N: Quorum write.
    StorageNode..N-->>StorageNode: Copy complete.
    deactivate StorageNode..N
    StorageNode-->>GatewayNode: data stored.
    deactivate StorageNode
    GatewayNode-->>User: File is stored.
    deactivate GatewayNode

The System's sequence diagram when fetching a file by Object Storage Gateway.

sequenceDiagram
    autonumber
    actor User
    
    User->>GatewayNode: Sending the file's identifier.
    Note over User, GatewayNode: LoadBalancers before GatewayNodes.
    activate GatewayNode
    
    loop sometimes
        GatewayNode->IndexLayer: Syncing StorageCluster's states.
    end
    
    GatewayNode->>GatewayNode: Calculating metadata's position. 
    Note over GatewayNode, GatewayNode: This calculation uses Consistent Hashing. 
    
    GatewayNode->>GatewayNode: Calculating all data stripes' position. 
    
    Note over GatewayNode, GatewayNode: Use quorum read to make sure GatewayNode read the newest content. 
    par Collecting data stripes.
    GatewayNode->StorageNode0..N: Collecting stripesN as step 4 5.
    and
    GatewayNode->StorageNode0..N: Collecting stripesN as step 4 5.
    end
    GatewayNode-->>User: Returing the file.
    deactivate GatewayNode

The System's sequence diagram when clusters scale up/down.

sequenceDiagram
    autonumber
    participant IndexLayer
    participant StorageNode..N 
    
    StorageNode..N->IndexLayer: Heartbeat from node1, add a new StorageNode
    StorageNode..N->IndexLayer: Heartbeat from node2, add a new StorageNode
    StorageNode..N->IndexLayer: Heartbeat from node3, add a new StorageNode
    
    Note right of IndexLayer: IndexLayer collects healthy StorageNodes
    IndexLayer->>IndexLayer: Calculating new nodestatus and data distribution.
    IndexLayer->>StorageNode..N: Ask node1,2,3 to join group1 and do data re-distribution.
    StorageNode..N-->>IndexLayer: node1,2,3 Ready
    IndexLayer->>IndexLayer: Put the new data distribution into uncommited version of node status.
    
    StorageNode..N->IndexLayer: Heartbeat from node1, leader of group 1, almost full, need data re-distribution
    StorageNode..N->IndexLayer: Heartbeat from node4, add a new StorageNode
    StorageNode..N->IndexLayer: Heartbeat from node5, add a new StorageNode
    StorageNode..N->IndexLayer: Heartbeat from node6, add a new StorageNode
    
    Note right of IndexLayer: IndexLayer collects healthy StorageNodes
    IndexLayer->>IndexLayer: Calculating new nodestatus and data distribution.
    IndexLayer->>StorageNode..N: Ask group1 to shrink data range and node4,5,6 to join group2 and extend data range.
    StorageNode..N-->>IndexLayer: node1,2,3,4,5,6 Ready
    IndexLayer->>IndexLayer: Put the new data distribution into uncommited version of nodestatus.
    
    loop
        IndexLayer->>IndexLayer: When the last stable nodestatus expires, replace it with uncommitted version.
    end

Storage Node Clusters scale up or down don't matter. Storage Nodes' data re-balancing matters.

Why: if adding nodes or losing nodes won't change data distribution, then it doesn't matter. Only data re-balancing matters, because it will change data distribution.

Testing

E2E test

Execute run.sh to testrun the whole system.

Integration test

StorageConnector has Integration tests in connector_test.go to verify the interface inplementation correctness.

Unit test

StorageConnector has unit tests in connector_test.go to verify the right behaviour.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
Gateway		Gateway
Monitor		Monitor
Storage		Storage
StorageConnector		StorageConnector
TLAspec		TLAspec
e2e_tests		e2e_tests
.gitignore		.gitignore
README.md		README.md
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HADSS

Design Philosophy

Architecture

Gateway Layer

Index Layer

Storage Layer

How it works

Testing

E2E test

Integration test

Unit test

About

Uh oh!

Releases

Packages

Uh oh!

Languages

emon100/HADSS

Folders and files

Latest commit

History

Repository files navigation

HADSS

Design Philosophy

Architecture

Gateway Layer

Index Layer

Storage Layer

How it works

Testing

E2E test

Integration test

Unit test

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages