===================================================== Diagnostic Tool Brainstorming Session (EGEE JRA4 F2F) ===================================================== Date: 9th May 2005 Venue: DANTE, Cambridge Glossary ======== MP = Monitoring Point Actions ======= [LK] Investigate whether Mark can have access to the GN2:JRA1 questionaire responses. [ML/RA/AKP] Have a discussion on how results are distributed around the target time in the timeperiod specified by the NMWG specification (need clarification, especially about what happens with asymetric -ve and +ve time tolerences). General Discussion ================== Mark would like to have access to questionaire responses to the GN2:JRA1 document that investigated NOC/GOC's metric requirements. There was a very high response rate to this questionaire. Loukik will look into this. Existing Tools ============== Ratna showed examples of two currently available analysis UIs to performance data: * NLANR Performance Advisor * Traceanal ML said that Advisor was not really suitable for users, and could be thought of as being a front end to the Mediator. Generally Traceanal was thought to be good because of its use of graphs. Query ===== max results is useful to limit the size of the results a user gets back. Such limits make sense when the user submits a query that would return a lot of data. When submitting a query, a user should choose the format in which the output from their results should be displayed to them. Robert showed a web page (from RipeTTM) that displayed a matrix of TCP (?) throughput between RipeTTM boxes. This matrix displays a last and current measurement value. This web page allows a user to drill down into one of the elements of the matrix to get more information. The drill down information showed graphs for the throughput over a set of time periods. Robert said that such overview matricies were very useful. To be able to build such matricies user must able to specify more than one request at a time. For source and destination, a user should be able to select from a drop down list of MP names and IPs. This is valid because a NOC/GOC user will know the MP or will be able to work it out from the name of the MP and the source or destiniation they are investigating. The choice of characteristics the user can query for should be limited to those available from the source destination pair chosen by the user. The client should be able to get available MP names and available characteristics from the Mediator. The statistics that would be useful to users are: min/max, stddev/mean/median, percentile (parameterised by percentile group). Users should enter time periods as one of: start and end dates; or fixed period from a specific date/time (e.g. 24hrs, 7 days, 14 days, 1 month). There was a discussion about the distribution of results about a time period when the maximum number of results is specified. (Reminder: time periods are specified in NMWG as: target time with +ve and -ve time tolerence). No real conclusion was arrived at. Mark said that data should be grouped around the target time, but was not sure what happens when -ve and +ve tolerences are asymeteric. Submission ========== There should be a cancel. It was noted that this would not actually cancel the request on the server (there is no way to do this), it would just free up the client. Displaying Results ================== Graphs should be provided for display. Type of graph depends on measurement. Line graphs used for throughput and histograms for delay data. Analysis ======== Following analysis items were mooted: * metrics from different src/dests on same plot; * correlation plots (e.g. RTT against TCP); * network closeness * thresholds - to display data outside thresholds * alarms RA 11/05/2005