Research Automation
The automation of data handling has become increasingly important for today’s researcher. Globus addresses this need in a variety of ways. With Globus researchers can:
- Schedule recurring transfers in the Globus web application
- Set up a simple, single step flow, or multi-step data flows at scale through Globus Flows
- Use the Command Line Interface (CLI) to automate data handling
- Automate the management of instrument data with Globus
Timers and Recurring Transfers
Scheduled and recurring transfers can be reliably managed by Globus through the web app. Nightly backups to cloud storage or hourly syncs to HPC centers can be configured with a few clicks.
- Schedule a recurring transfer in the web app
- No need to maintain cron jobs or locals scripts - Let Globus reliably manage the transfer
- Configure in a few clicks and schedule large transfers when demands are low (late at night)
- Set your transfer and timer options that are right for your job
- View and manage your scheduled or recurring transfer directly in the web app
Globus Flows
Globus Flows is a foundational service for defining and executing secure, reliable automated data flows at scale. Use cases abound where automation is required to cope with growing data volumes and velocities like those generated by cryo-electron microscopes, next generation sequencers, and advanced light sources.
- Define and execute secure, reliable automated data flows at scale
- Replicate data across multiple storage systems
- Manage intricate multi-step conditional data analysis and results with human in-the-loop review and confirmation
Globus Flows define actions that are executed in a specified order using “action providers” to act on resources both on the Globus platform and externally managed services. We encourage users to extend the capabilities of Globus Flows by creating action providers to interface with their own applications. For example, if you have developed a service to perform some domain-specific analysis, you can add an action provider that allows users to access the service and incorporate it into broader automated flow. The Globus Flows service scales and performs reliably by leveraging Step Functions on Amazon Web Services.
Explore Globus Flows
- Start by experimenting with Flows using Jupyter notebooks
- Check out this SDK that provides a CLI and a convenient Pythonic interface to the suite of Globus automation services
- A guide for authoring Globus flows
- Globus SDK for Python: Documentation · Source Code
Command Line Interface
The CLI provides an interface to Globus services from the shell, and is suited to both interactive and simple scripting use cases. It is open source and available at https://github.com/globus/globus-cli