Fifemon

Screen Shot 2016-06-07 at 2.33.05 PMThere’s a dashboard for that” is the unofficial motto of Fifemon, and to that end we are constantly collecting more data and producing new dashboards. Since the last update, we have added nearly 20 new dashboards, including high-level computing summaries, dCache and SAM monitoring, and troubleshooting guides. In addition to these new dashboards, we have made many improvements to the existing dashboards. Read more for a look at upcoming changes, features and upgrades and to learn more about how Fifemon is impacting and how we are working with the scientific computing community outside of Fermilab.

Fifemon is continuing to evolve to provide better monitoring for all users and support personnel. The past month saw the release of versions 3.00 and 3.01, which include many new and revised dashboards, including several high-level “summary” dashboards:

Complete release notes can be found on the About Fifemon dashboard.

Upcoming

Several major changes are currently in testing on Pre-production Fifemon, including moving authentication from Services LDAP to Federated Single Sign-On (SSO). Users will continue to log in with their Services user name and password, except it will be done through the SSO server instead of directly in Grafana. The change to SSO authentication opens up the possibility for more fine-grained authorization, including potentially allowing users to create their own customized dashboards!

The other major change is the upgrade of Grafana to Version 3, which includes numerous UI improvements, the ability to set any dashboard as your own homepage and more. You’re invited to test these changes and provide feedback (via the User Jobs Monitoring (fifemon) Scientific Computing service offering in Service Now) before they get released to production.

Fifemon Outside Fermilab

In spring of 2016, Fifemon was presented at the Open Science Grid all-hands meeting and at HTCondor Week 2016; these presentations were well-received by the community, which was impressed by the amount of information being made available to users and service providers and has expressed an interest in adopting Fifemon for use at other sites. We have begun working with collaborators outside Fermilab so their experience can be used to improve Fifemon for our users, and we hope that by encouraging other sites to establish more comprehensive monitoring, we will be able to get a better view of what resources are available to FIFE experiments at those sites.

For more information, please see the project’s external homepage at https://fifemon.github.io/.

The following items, in no particular order, are some of the things that can appear on a dashboard:

Screen Shot 2016-06-07 at 2.34.22 PM Screen Shot 2016-06-07 at 2.33.54 PM Screen Shot 2016-06-07 at 2.33.45 PM Screen Shot 2016-06-07 at 2.33.39 PM Screen Shot 2016-06-07 at 2.33.32 PM

— Kevin Retzke