noced by any clients or any other components. The watchdog service restarts the server and the server
comes back up in about two seconds.
JBoss Cluster
The JBoss applicaon server runs on all nodes except dedicated database nodes in the Junos Space
cluster. The nodes form a single all-acve logical cluster and the load-balancer server (described
previously) distributes the load across all the nodes. Even if one or more of the JBoss servers in the
cluster fails, the applicaon logic sll connues to be accessible from the surviving nodes. JBoss servers
on all nodes are started with the same conguraon and use UDP mulcast to detect each other and
form a single cluster. JBoss also uses UDP mulcast for session replicaon and caching services across
all the nodes.
NOTE: The JBoss server does not run on Fault Monitoring and Performance Monitoring (FMPM)
nodes and hosted virtual machines.
When the JBoss server on a node goes down, other nodes in the JBoss cluster detect this change and
automacally recongure themselves to remove the failed node from the cluster. The me taken by
other cluster members to detect a failed JBoss server depends on whether the JBoss server process
crashed abnormally or is unresponsive. In the former case, cluster members detect the failure
immediately (around two seconds) because their TCP connecons to the crashed JBoss server are
closed by the operang system. In the laer case, cluster members detect the failure in about 52
seconds. If a JBoss server crashes, the JBoss server is restarted automacally by the watchdog service
(jmp-watchdog) running on the node. When the JBoss server comes back up, the JBoss server is
automacally discovered by other cluster members and added to the cluster. The JBoss server then
synchronizes its cache from the other nodes in the cluster. The typical restart me for the JBoss server
is two to ve minutes, but it can take more me depending on the number of applicaons installed, the
number of devices being managed, the number of DMI schema versions installed, and so forth.
One JBoss server in the cluster always acts as the primary of the cluster. The main purpose of the
primary designaon is to host services that are deployed as cluster-wide singletons (HA singletons)—for
example, services that must be deployed on only one server in the cluster at any me. Junos Space uses
a several services of this type, including the Job Poller service, which provides a single mer for
scheduling jobs across the cluster, and the Distributed Resource Manager (DRM) service, which
monitors and manages the nodes in the cluster. These services are deployed only on the JBoss server
that is designated as the primary.
NOTE: This does not mean that the primary does not host other services. Non-cluster singleton
services are also hosted on the primary server. Junos Space is congured such that the rst JBoss
13