News & Events‎ > ‎News‎ > ‎

European Globus Statistics Collector (EGSC) Service Activated

posted 28 Jun 2011, 08:05 by Matthias Hofmann   [ updated 19 Jul 2011, 07:25 ]

Globus tools have been reporting usage data back to a statistics collector in the USA (Argonne) for several years now. The primary reason for this is to be able to prove to funding agencies that the money invested into Globus development benefits a huge community of Globus users. The collected data include:

  • Number of GridFTP users
  • Number of jobs submitted to GRAM
  • Number of running GT4 web services

Sending this information to the USA is legally problematic since the logged data also contain IP addresses which are considered privacy critical data in Europe. To circumvent this problem, IGE has set up an European Globus Statistics Collector (EGSC) where recorded data is aggregated and forwarded to the US in an anonymized, cumulative manner in order to fulfill European privacy concerns and the Safe Harbor Agreement.


Having the EGSC also helps IGE to figure out if Globus is indeed widely used in Europe. In all IGE software releases (http://repo-deb.ige-project.eu/ and http://repo-rpm.ige-project.eu/) the EGCS is set as the collector address instead of the US one. If users are still concerned about their privacy, they can opt-out of the statistics collection at any time by setting an environment variable. For more information see http://globus.org/toolkit/docs/5.0/5.0.4/Usage_Stats.html. IGE's EGSC runs at the address: “statistics.ige-project.eu:4810”.


The EGSC service is prepared to collect usage data from different Globus Toolkit (GT) services across Europe and is intended for use by any European deployments of the Globus Toolkit. Among the services the EGSC can gather are usage data from GridFTP, GRAM, gsissh, RLS for version 4 and 5 of the Globus Toolkit. The corresponding GT services are implemented in a way that they can be configured to send such kind of information.

Data collected by the IGE's EGSC will be used for reporting purposes and will provide input to the Quality Assurance (QA) processes of the IGE project. We mention that only the servers usage data is collected and stored in the EGSC database and will be used.

A guideline for explicitly configuring different GT5 and GT4 services to send usage statistics to the IGE's EGSC is available here.

Comments