Details
-
Type:
User Story
-
Status:
Open
-
Priority:
Major
-
Resolution: Unresolved
-
Affects Version/s: None
-
Fix Version/s: None
-
Component/s: GRAM
-
Labels:None
-
Ranking:None
Description
ATLAS (and others) want to be able to cluster a set of GRAM2 services in a HA setup to provide greater scalability and reliability.
It would be really nice if there was a well-understood and tested mechanism to provide load balancing and failover in GRAM by having multiple gatekeepers. This is not trivial because submitting a job to one gatekeeper creates state on that gatekeeper. That said, production sites would like to find ways to keep a site running when a single gatekeeper goes down.