Subject: NPPS Leadership Team
Fwd: Computing and data strategy
Date: Mon, 27 Nov 2023 16:14:23 -0500
Attached is John Hill’s plan sent to the ALDs for Phase 1 input.
NPP will probably have a committee to prepare the response.
Details to come soon.
A few lines for the relevance of including Markus:
As ePIC S&C coordinator, the senior S&C role in the EIC community today and for some time to come, Markus is an ideal representative of that important community. As a S&C leader in JLab’s physics community he also brings an important perspective on their new data facility. As the ePIC representative to the emerging “Computing and Software Joint Institute” for EIC, which has produced friction between the facility organizations and the experiment, he brings an important perspective on experiments and computing facilities working in harmony to serve the science.
I suggest Markus Diefenthaler at JLab, ePIC S&C coordinator
Hello Dmitri –
Simone Campana is also a good name. I thought of him and suggested Ale because Ale has been more engaged with SDCC issues recently and participated in BNL reviews. But Simone is also good.
However, I would suggest nominating at least one non-US candidate – they do bring a different perspective, so suggest picking either Ale or Simone.
Subject: RE: Computing and data strategy

Hi Steve,
Heidi is nice, but too busy, including her coming DPF responsibilities. Paul is an interesting proposal, while I suspect they will prefer US, including for travel to the meetings. Simone is a good candidate.
Would Heidi Schellman be a good candidate as well?
What about Simone Campana?
Paul Laycock?
> Having NSF connection is very useful. Just waiting for others to
> comment/suggest. We will need couple of lines about the persons we are nominating (relevance to the committee job).
> Thanks, Dmitri.
> Kaushik is an IPA at NSF currently and also continues in atlas at 25%.
> Since he is NSF and not DOE, I suggested his name.
> Dear Srini,
> All very good suggestions. My understanding Kaushik is NSF now – or he is on leave of absence?
> Let’s hear what others think and then submit a list.
> Thanks, Dmitri.
> Hello Dmitri –
> I would like you to consider the following:
> Alessandro di Girolamo (CERN). He is the person I had suggested
> bringing to BNL as a sabbatical and was also involved in an earlier SDCC review. He just stepped down as the ATLAS computing coordinator.
> Kaushik De (UTA, ATLAS) – he was one of my US ATLAS computing coordinators.
> Peter Elmer (Princeton, CMS) – leading the IRIS HEP efforts amongst
> many other things. Regular presence at many reviews.
> Shaun McKee (Michigan, ATLAS) – manages all the US ATLAS T2 centers, well experienced with networking.
> Torre of course may suggest some good additional candidates.
> Srini
> Dear All,
> John is asking for the names for the external advisory committee. HEP,
> including ATLAS, Belle II with DUNE ramping up, is ~50% of SDCC
> efforts, so we have to have strong representation on the external
> committee. I suggest we have couple of ATLAS (CMS is fine) reps, at
> least one from Belle II and at least one from DUNE. If you can suggest
> names (above numbers above is perfectly reasonable as John will have
> to take into account affiliations, etc.) is fine. Remember, we are looking for non-BNL reps. No need, yet, to check if they are available or not. If you can send names to me by early next week, it will be great.
> Thanks, Dmitri.
> At today’s science council meeting, John Hill discussed
> the plan for the “Computing and data strategy”. See attached slides.
> The last attempt a few years ago did not go anywhere, but this time,
> the lab management wants to take ownership of this plan.
> Please take a look at the “plan for the plan”.
> An external advisory committee is foreseen. It would be
> good to give our recommendations who can serve on the committee.
> Best,
> Hong.
Dear colleagues:
I share an email before from John Hill on Computing and Data Strategic Plan. We will discuss NPP response to the request this afternoon. Thanks.
Best, Haiyan
Subject: Computing and Data Strategic PlanHaiyan, Jim, Martin, David and Kerstin,
As you know I am running a process to look at the way we do data and computer science here at BNL in order to make sure that we are positioned for success.
The first step is to perform a self-assessment (Phase I). Phase II will consist of us answering some guiding questions for ourselves, so that we may then write a coherent strategic plan (phase III). I expect the whole process to take something like a year – this is a lot of work, but it vital that we get this right. Appended below is an outline of Phases I,II and III. I also expect to bring in outside experts to give us advice along the way.
I am writing today to ask to you each to answer the following question for your organization (i.e phase I):
- What do you need from a data science perspective from BNL?
i. Compute
ii. Storage
iii. Infrastructure (software, hardware, services, support, OSS policy,…)
iv. AI/ML
v. Workforce development in data science
- Training
- Recruitment
- Retention
- Were there opportunities you lost out on because we did not have computing capabilities (expertise or access to hardware)
- What are the key areas where data science will be needed by you in the next 5-10 years - what is needed to be competitive?
- Please carry out a SWOT analysis for computer science within your organization
- In the area of scientific data and computing, for each of ITD, CSI and SDCC document:
i. What is working
ii. What are the barriers to working more effectively with that organization
iii. What would be most helpful to you in enabling your science
Please answer these questions in a maximum of ten pages. For those of you with large scale facilities within your organizations, or large scale science (e.g. ATLAS), you will want to provide two sets of answers to the above questions. One for the small scale science and one for the large facility-driven science. This could be two separate documents, or delineated within one with a higher page count..
Please get these to me by Friday February 2nd, 2024. If you can get them to me earlier, that would be very helpful, but I understand all that you have going on.
If you have any questions now, or as you get to writing, please reach out. Happy to discuss at any time.
Thank you very much,
Phase I: Self-assessment
- Charge each of ITD, SDCC and CSI with documenting:
- What is your mission as you currently see it
- SWOT for your organization in meeting that mission
- Benchmark your organization against equivalents at other national labs (size, mission, capabilities, impact)
- Barriers in achieving your mission (external and internal to BNL)
- Charge Science Directorates: Facilities and Core Depts Separately
- What do you need from a data science perspective from BNL?
- Compute
- Storage
- Infrastructure (software, hardware, services, support, OSS policy,…)
- Workforce development in data science
- Training
- Recruitment
- Retention
- Were there opportunities you lost out on because we did not have computing capabilities (expertise or access to hardware)
- What are the key areas where data science will be needed by you in the next 5-10 years - what is needed to be competitive?
- Please carry out a SWOT analysis for computer science within your organization
- In the area of scientific data and computing, for each of ITD, CSI and SDCC document:
- What is working
- What are the barriers to working more effectively with that organization
- What would be most helpful to you in enabling your science
Phase II: Determine guiding parameters for strategic plan
- BDISC Brookhaven Data Infrastructure Steering Committee (reconstitute with DDST as Chair)
Charge it with answering the following questions as input for the strategy
- What is our vision for data at BNL to enable our mission and BNL vision
- Building on analysis of in 1) and 2), determine areas of computer science R&D where BNL is leading, or poised to take a lead.
- What are ASCR's priorities in the next 5 years? What about other Directorates within SC? And elsewhere in the federal government
- Where can BNL have the biggest impact?
- What are the possible growth areas in Computer Science (if different from ASCR priorities) and how can we prepare for them?
- Identify role of BNL in
- Developing AI/ML
- Utilizing AI/ML
- Identify unexploited synergies across existing lab efforts in computing.
- What security posture is required for different use cases?
- Determine areas of CS R&D required to support BNL mission-driven computing (development of new technologies or frameworks, as opposed to development required to adopt/adapt existing ones)
- Identify unexploited synergies/engagements with Stonybrook CS (Institute for Advanced Computational Science) or other partners
- How should BNL interact with the ASCR facilities including HPDF and LCFs
- Analysis of on-prem vs off-prem for various use-cases
- Identify areas of ITD, CSI and SDCC which need improvement in order to support the lab mission successfully
Phase III: Write Strategic plan
- With the input from phases I and II, develop strategy doc to achieve vision. Document should include
- Vision for data science at BNL
- Strategy to support to support small experiments and theory
- Strategy to support large scale facilities
- Strategy for computer science at BNL
- Cross-cut strategies
- Role for HPDF and other LCFs
- Role of cloud and on-prem compute and storage
- Cyber security
- AI/ML strategy
- Workforce development
- Infrastructure development
- Compute
- Storage
- Network
- Policy
- ..
- Identify the organizational structure that would best position the lab for achieving its data strategy.
- Timeline with milestones to execute strategy
