Reprocessing: Meeting of 18-July-2005: 9:30-10:30CDT ESNet video conference
Our meeting number on the ESNet is 823073776 (82d0repro).
Instructions to dial into a video conference via phone.
Agenda
- News
- Features for new JIM release cut (Andrew Baranovski)
- Status of remote site Certification
- Status of production
- JIM Deployment and Remote Setup of remaining sites.
- AOB.
Minutes
Participants
Tibor Kurca, Gabriele Garzoglio, Parag Mhashilkar (Lyon), Yann Coadou (Vancouver),
Jae Yu (FNAL, Outback), Frederic Villeneuve-Seguier (London),
Andrew Baranovski, Mike Diesburg, Daniel Wicke (FNAL, WH7X)
Topics
- News
- Marco Verzocchi found 32 files that have a wrong event count. In many cases we still have the log files and understood what happened.
The new release cut of JIM will include a fix that verifies the number of events before storing a merged file.
- 8 additional file were corrupted, can't be read.
- Recovery will be done with p17.05.xx probably at the d0farm.
- Status of new JIM release cut (Andrew Baranovski)
- All code is in CVS and realease (since Friday).
- SamGrid team is testing new features on the SAMGrid-farm (from today).
- Deployment to d0farm expected on Wednesday. Additional testing of new features from Friday or Monday Lyon.
- Status of production
- WestGrid: Home partition got filled. Recovery ongoing. No special problems.
Prestageing: New prestaged files are no longer pinned, which makes downloads way faster and removed interference with production.
Unpinning files with sam uncache file.
- Lyon: No (new) problems. New datasets required (within the next week).
- SAR-UTA: Limitation to 5 merge jobs resolved (though not understood). New datasets required (within the next week).
- SAR-Oscer: ??
- CMS-Farm: ??
- Wisconsin: ??
- Prague: ??
- Manchester: ??
- Imperial College: Extended size of durable location. Now starting to run with new setup.
- GridKa: ??
- Status of remote site Certification
- Status of production certification.
- Status of Merge Certification of Sites
- DØFarm: done.
- WestGrid: done.
- Lyon: done.
- SAR: UTA: done
- SAR: Oscer: done
- GridKa: done.
- Wisconsin: done.
- Prague: done.
- GridKa: done. But expected to run remaining two dataset.
- SAR-SPRACE: ongoing
- JIM Deployment and Remote Setup of remaining sites.
- RAL: Admins could be convinced to install dedicated gatekeeper, which is now being installed.
- Lancs: Site installed, now testing setup.
- AOB:
- Joe will produce a comparision of the various versions of recocert we used
and put the comparision to rou web page.
Action Items:
Next Meeting
25-July-2005.
Mike Diesburg, Daniel Wicke, 16-July-2005. Last Change 18-July-2005.
Diesburg@fnal.gov,
Wicke@fnal.gov