Register for the workshop on the KeystoneREN website
June 17, 2025 | |
---|---|
8:15—8:45 | continental breakfast & networking |
8:45—9:00 | Opening Introductions/Welcome |
9:00—9:15 | Jefferson University’s Journey to Globus |
9:15—10:15 |
We will provide an overview of the Globus platform, and demonstrate several data management features. Serving as an introductory session suitable for all current and prospective users, we’ll use the Globus web app to show data transfer and sharing, and connect your laptop/desktop to Globus. We will also introduce the Globus Command Line Interface. This introduction will provide important context for subsequent sessions. |
10:15—10:45 | break |
10:45—11:45 |
The Science DMZ is a scalable network design pattern that facilitates an optimized way for exchanging research and education data transfers. This architectural paradigm is a portion of the network, built at or near the campus or laboratory's local network perimeter, that is designed such that the equipment, configuration, and security policies are optimized for high-performance scientific applications rather than for general-purpose business systems or “enterprise” computing. This talk will give the background of the approach, and cite examples that can be implemented by workshop participants. |
11:45—13:00 | lunch (provided) |
13:00—13:30 |
The Science DMZ architecture optimizes the network path to support high-performance scientific applications. A core component of this approach relies on purpose-built and dedicated computer systems that support the function of wide area data transfer. Data Transfer Nodes (DTNs) are servers built with high-quality components and configured specifically for wide area data transfer. The DTN has access to local storage, and runs the software tools designed for high-speed data transfer to remote systems. This talk will give the background of DTNs, show ways they can be integrated into the network, and show effective build and test strategies. |
13:30—14:15 |
We will present an overview of Globus services for automating research computing and data management tasks, to accelerate research process throughput. This session is aimed at researchers who wish to automate repetitive data management tasks (such as backup and data distribution to collaborators), as well as anyone working with instruments (cryoEM, next-gen sequencers, fMRI, etc.) who wishes to streamline data egress, downstream analysis, and sharing at scale. The material in this session will serve as an introduction to the more advanced concepts that will be covered in detail on day two of the workshop. |
14:15—14:45 | break |
14:45—16:00 |
We will review the Globus Connect Server architecture and deployment model, and describe how system administrators can create a Globus endpoint to access on-premises and cloud-hosted storage systems. You will experiment with installing Globus Connect Server, and configuring a number of common options on the endpoint. We will also demonstrate how to monitor and manage user activity, and options for optimizing file transfer performance. |
16:00—16:30 |
We will present various use cases for applying the Globus data sharing capability, and discuss use of service accounts for automating data access and transfer tasks. |
June 18, 2025 | |
8:15—8:45 | continental breakfast & networking |
8:45—9:45 |
Open discussion, facilitated by the Jefferson team, to identify examples of research data management and computation and common patterns that emerge, as well as opportunities for applying the technologies discussed at the workshop. |
9:45—10:15 |
With the use cases above as context, we will dive deeper into the Globus automation platform and describe how common instrument-based scenarios may be streamlined. We will examine the various components of a Globus flow that takes data from the point of capture on an instrument through to distribution of analyzed data to collaborators, and even broader publication of result data. We will start with a simple series of tasks and progressively work through more complex activities that incorporate computation as well as data description, publication and discovery. In the course of automating these more advanced scenarios we will describe and demonstrate the Globus Compute, Globus Search and Globus Flows services. |
10:15—10:45 | break |
10:45—12:00 |
This session—and the remainder of the day—will be spent defining various automation-enabling services in more detail and progressively augmenting the solution. We will engage in lots of hands-on exercises to familiarize you with many new capabilities and how they may be leveraged using diverse storage and compute resources. |
12:00—13:00 | lunch (provided) |
13:00—13:45 |
Regional Science DMZ Architecture and the PA Science DMZ
|
13:45—14:30 |
Measuring performance of networks is a critical part of ensuring proper operation, and debugging anomalous behavior. perfSONAR is an infrastructure for network performance monitoring, making it easier to solve end-to-end performance problems on paths crossing several networks. It contains a set of services delivering performance measurements in a federated environment. This talk will give a brief history of perfSONAR, suggest ways it can be integrated into network environments, and discuss ways it can be leveraged to solve problems and ensure proper operational performance. |
14:30—15:00 | break |
15:00—15:45 |
This session will cover topics of interest to system administrators (such as managing multi-DTN endpoints, mapping user identities, and using custom domains for data access), as well as other advanced topics of interest to the audience (these will be identified throughout the course of the preceding workshop sessions). |
15:45—16:30 |
We will be available for 1:1 discussions and consultation on specific use cases. Bring us your toughest questions! |