{"Gluex":{"description":"\n[GlueX](http://www.gluex.org) is an experiment at the Thomas Jefferson National Accelerator\nFacility (JLab) in Newport News, Virginia, that studies how particles called mesons behave\nto learn more about the strong force—the force that holds atomic nuclei together. The dataset\nfrom GlueX comes from millions of collisions between high-energy photons and protons. GlueX\nuses the OSDF to distribute inputs to its data simulations and is exploring using OSDF for\nreprocessing.\n\nGlueX is supported by the US Department of Energy.\n","organization":"Jefferson National Laboratory","dataVisibility":"public","size":null,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":"GlueX","namespace":["/gluex","/Gluex"],"thirtyDayReads":0,"oneYearReads":215894157976,"publicObject":null,"organizationUrl":"https://www.jlab.org","repositoryUrl":{"url":"http://www.gluex.org"}},"VDC-PUBLIC":{"description":"Experiments related to the Virtual Data Collaboratory at the Scientific\nComputing and Imaging Institute at the University of Utah.\n\nThese cyberinfrastructure experiments include activities like running automated\nworkflows on the OSPool triggered on alerts from the [EarthScope Consortium](https://www.earthscope.org/).\n","organization":"University of Utah","dataVisibility":"public","size":null,"fieldOfScience":"Computer and Information Sciences","numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":"Virtual Data Collaboratory","namespace":["/VDC/PUBLIC"],"thirtyDayReads":0,"oneYearReads":12609677166,"publicObject":null,"organizationUrl":"https://www.utah.edu/","repositoryUrl":{"url":"https://par.nsf.gov/servlets/purl/10187417"}},"aws-opendata":{"description":"**[AWS Open Data](https://aws.amazon.com/opendata/)** hosts\npublicly accessible datasets covering areas such as earth science, climate,\ngenomics, machine learning, transportation, and economics. The collection\nincludes contributions from a range of organizations, including government\nagencies, academic institutions, and private companies.\n\nThere are currently nearly **700 datasets**, totaling over **100 petabytes of data**.\n\nBrowse the full catalog at the **[Registry of Open Data on AWS](https://registry.opendata.aws)**.\n\nThe AWS Open Data datasets are publicly accessible and are integrated with the OSDF, allowing\nusers to stage the data closer to nationally-funded computing resources via the OSDF's\nhardware infrastructure. This enables fusion between AWS Open Data and other data sources\naccessible via the OSDF.\n","organization":"Amazon Web Services, Inc.","dataVisibility":"public","size":null,"objectCount":null,"fieldOfScience":"Multi/Interdisciplinary Studies.","numberOfDatasets":690,"rank":10,"inprogress":false,"display":true,"name":"Amazon Web Services Open Data","namespace":["/aws-opendata/us-east-1","/aws-opendata/us-west-1","/aws-opendata/us-west-2"],"thirtyDayReads":null,"oneYearReads":null,"organizationUrl":"https://aws.amazon.com/opendata/","repositoryUrl":{"url":"https://registry.opendata.aws/","label":"Dataset Catalog"},"publicObject":"/aws-opendata/us-east-1/tcga-2-open/01063b4e-5a0d-4f11-8061-f8fd9a5f2fa5/5e3fd2aa-e0dd-4fe8-9944-05bba5d6bd91.FPKM.txt.gz"},"caida-protected":{"description":"The Center for Applied Internet Data Analysis (CAIDA) runs an \"Network Telescope\", collecting\npackets sent to a cross-section of the public Internet similarly to how a telescope collects stray\nlight.\n\nThis dataset is made available to scientists attempting to understand how activity, such as malware,\nis moving across the Internet.\n\nThe CAIDA integration with OSDF aims to stage the most recent subset of the recorded data to be\nmade available for large-scale analysis.\n","organization":"University of California, San Diego","dataVisibility":"private","size":null,"objectCount":null,"fieldOfScience":"Computer and Information Sciences and Support Services","numberOfDatasets":null,"rank":0,"inProgress":true,"display":true,"name":"Center for Applied Internet Data Analysis","namespace":["/caida/protected"],"thirtyDayReads":0,"oneYearReads":0,"organizationUrl":"https://www.caida.org/","repositoryUrl":{"url":"https://catalog.caida.org/search?query=types=dataset","label":"Dataset Catalog"}},"chtc-specialprojects":{"description":"Staging data for CHTC collaborations with University of\nWisconsin-Madison research groups. Currently serving data\nspecifically for the Joao Dorea group.\n","organization":"University of Wisconsin-Madison","dataVisibility":"private","size":78000000000000,"objectCount":null,"fieldOfScience":"Agriculture, Agriculture Operations, and Related Sciences","numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":"CHTC Staging for Campus Collaborations","namespace":["/chtc/specialprojects"],"thirtyDayReads":5286894039594,"oneYearReads":5339965187224643,"organizationUrl":"https://dorealab.cals.wisc.edu/","repositoryUrl":{"url":null}},"chtc":{"description":"The Center for High Throughput Computing (CHTC), established in 2006, aims to\nbring the power of High Throughput Computing to all fields of research, and to\nallow the future of HTC to be shaped by insight from all fields.\n\nBeyond technologies and innovation and HTC through projects like\n[HTCondor](https://htcondor.org), the CHTC operates general purpose clusters for\nthe UW-Madison campus. CHTC allows researchers to stage their research data\nto an object store connected to the OSDF and then process and analyze the data using\nthe OSDF with on-campus resources or the [OSPool](https://osg-htc.org/ospool).\n\nThis data is organized as \"working datasets\" representing running workloads, not\npermanent scientific outputs.\n","organization":"University of Wisconsin - Madison","dataVisibility":"private","size":445115415442995,"objectCount":null,"fieldOfScience":"Multi/Interdisciplinary Studies","numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":"CHTC Researcher Data","namespace":["/chtc","/chtc/specialprojects"],"thirtyDayReads":118701425179211,"oneYearReads":5341756039548910,"organizationUrl":"https://wisc.edu","repositoryUrl":{"url":"https://chtc.cs.wisc.edu/"}},"eic":{"description":"The Electron-Ion Collider is a proposed facility being built at\nthe Brookhaven National Laboratory. Experiments at the facility\ninclude the ePIC detector. The computing for EIC is a joint collaboration\nwith the [Jefferson National Lab](https://www.jlab.org/eic); the datasets\nconnected to the OSDF include input files and other information necessary\nto help with simulations of the detector's behavior.\n","organization":"Jefferson National Laboratory","dataVisibility":"public","size":null,"objectCount":null,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":"Electron-Ion Collider Simulations","namespace":["/eic"],"thirtyDayReads":0,"oneYearReads":4120,"organizationUrl":"https://www.jlab.org/eic","repositoryUrl":{"url":null}},"envistor":{"description":"The South Florida region is home to nearly 10 million people, and the population is growing. The region faces several challenges,\nsuch as rising sea levels and flooding, harmful algae blooms, water contamination, and wildlife habit loss, which affects the economy\nand the welfare of its population. Florida International University (FIU) runs the EnviStor project, which is a centrally managed,\npetabyte-scale storage system that is also a clearing house for supporting interdisciplinary research and modeling involving both built\nand natural environments in South Florida. EnviStor provides opportunities for students\nand faculty to enhance their knowledge of database management, focusing on interoperability.\n\nThe datasets kept in EnviStor can be accessed via the OSDF; work is ongoing to provide new computing workflows and AI-based dataset\ndiscovery that will help users utilize the data.\n\nThe EnviStor activity and underlying storage is funded through the [NSF Campus Cyberinfrastructure program](https://www.nsf.gov/funding/opportunities/cc-campus-cyberinfrastructure) under\n[Award # 2322308](https://www.nsf.gov/awardsearch/showAward?AWD_ID=2322308).\n","organization":"Florida International University","dataVisibility":"public","size":null,"objectCount":null,"fieldOfScience":"Natural Resources and Conservation","numberOfDatasets":null,"rank":0,"inProgress":true,"display":true,"name":"EnviStor","namespace":["/envistor"],"thirtyDayReads":36,"oneYearReads":95684864,"organizationUrl":"https://www.fiu.edu/","repositoryUrl":{"url":"https://www.cis.fiu.edu/kfscis-professor-awarded-500000-nsf-grant-for-environment/"}},"et-gw-PUBLIC":{"description":"Simulation data used for the Einstein Telescope Mock Data Challenge.\n\nThe [Einstein Telescope](https://www.et-gw.eu/) (ET) is a proposed next-generation gravitational wave\nobservatory, aiming to detect gravitational waves with much higher\nsensitivity than either the LIGO or VIRGO instruments.\n\nAs part of the studies and the design proposal for the ET instrument, the\nmock data challenge is being run in 2024 and 2025 to better understand how the future data\nmay be distributed and analyzed. An example tutorial for using the data can\nbe found [on GitHub](https://github.com/elenacuoco/ET-MDC-Tutorials).\n","organization":"UCLouvain","dataVisibility":"public","size":6764573491200,"bytesXferd":null,"fieldOfScience":"Astronomy and Astrophysics","numberOfDatasets":3,"rank":0,"inProgress":false,"display":true,"name":"Einstein Telescope Simulations","namespace":["/et-gw/PUBLIC"],"thirtyDayReads":16372877593,"oneYearReads":359946611774579,"publicObject":"/et-gw/PUBLIC/MDC1/v2/data/E1/E-E1_STRAIN_DATA-1000008192-2048.gwf","organizationUrl":"https://www.uclouvain.be/en","repositoryUrl":{"url":"http://et-origin.cism.ucl.ac.be/"}},"example":{"description":"This is a cool description.\n\n# Screeeeeeeeeam\n","organization":null,"dataVisibility":null,"size":null,"bytesXferd":null,"url":null,"fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":null,"organizationUrl":null},"fdp-hpe":{"description":"The [Fusion Data Platform](https://github.com/Fusion-Data-Platform/fdp) (FDP) provides\na modern, Python-based data framework for analyzing data from magnetic fusion experiments.\n\nUsing data from the [DIII-D National Fusion Facility](https://www.ga.com/magnetic-fusion/diii-d),\nusers can leverage the FDP software to stream data via the OSDF services for their fusion data\nanalysis.\n\nThe FDP is funded by the DOE under award\n[DE-SC0024426](https://pamspublic.science.energy.gov/WebPAMSExternal/Interface/Common/ViewPublicAbstract.aspx?rv=5b18d4f7-1f1a-4858-b35a-1040e0f1900a&rtc=24&PRoleId=10).\n","organization":"General Atomics","dataVisibility":"private","size":null,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":"DII-D National Fusion Facility","namespace":["/fdp-hpe","/nrp/fdp"],"thirtyDayReads":0,"oneYearReads":553211044,"organizationUrl":"https://fdp.readthedocs.io/en/latest/user_guide.html","repositoryUrl":{"url":null}},"gwdata":{"description":"Public gravitational wave data from international gravitational wave network,\nincluding data from [LIGO](https://www.ligo.caltech.edu/), [VIRGO](https://www.virgo-gw.eu/),\nand [KAGRA](https://gwcenter.icrr.u-tokyo.ac.jp/en/). This data can be used\nin the detection and study of black holes throughout the universe.\n\nThese datasets are the calibrated readouts from the corresponding interferometers.\nAlso included are mirrors of data analysis products released to Zenodo to\naccompany publications.\n","organization":"California Institute of Technology","dataVisibility":"public","size":77032444311984,"objectCount":1290210,"fieldOfScience":"Astronomy and Astrophysics","numberOfDatasets":70,"rank":3,"inProgress":false,"display":true,"name":"Gravitational Wave Open Science Center","namespace":["/gwdata"],"thirtyDayReads":58523540450000,"oneYearReads":368065653158180,"publicObject":"/gwdata/zenodo/ligo-virgo-kagra/index.txt","organizationUrl":"https://gwosc.org/","repositoryUrl":null},"icecube-PUBLIC":{"description":"","organization":null,"dataVisibility":null,"size":null,"bytesXferd":null,"url":null,"fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"namespace":["/icecube/PUBLIC"],"thirtyDayReads":138632907277119,"oneYearReads":8172086399139906,"organizationUrl":null},"icecube":{"description":"The **IceCube repository** integrates data from the [*IceCube Neutrino Observatory*](https://icecube.wisc.edu),\na cubic-kilometer detector embedded deep in Antarctic ice near the South Pole. IceCube records when high-energy\nneutrinos interact with the ice. \n\nUsing over 5,000 optical sensors deployed between 1,450 and 2,450 meters below the surface, the observatory\ncaptures detailed information about these events, including their timing, location, and intensity. The data\nis used to study cosmic neutrinos and the astrophysical phenomena that produce them, such as **black holes**,\n**supernovae**, and **gamma-ray bursts**.\n\nThe IceCube collaboration is supported by [multiple funding agencies](https://icecube.wisc.edu/collaboration/funding/) including\nthe [NSF](https://www.nsf.gov/awardsearch/showAward?AWD_ID=2042807). The dataset is maintained by the\nWisconsin Icecube Particle Astrophysics Center.\n","organization":"University of Wisconsin - Madison","dataVisibility":"private","size":null,"objectCount":null,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"IceCube Neutrino Data","namespace":["/icecube","/icecube/PUBLIC"],"thirtyDayReads":139221399022659,"oneYearReads":7727299876651540,"organizationUrl":"https://wipac.wisc.edu"},"igwn-cit":{"description":"User-managed data by members of the [LIGO Scientific Collaboration](www.ligo.org), the\n[Virgo Collaboration](https://www.virgo-gw.eu/), and the [KAGRA Collaboration](https://gwcenter.icrr.u-tokyo.ac.jp/en/organization).\nThese data are created and used within individual users' workflows as they analyze gravitational-wave data in order\nto detect black hole collisions and other cosmic phenomena. This origin is hosted at Caltech.\n\nThis data is not public; it is in support of in-progress computational workflows.\n","organization":"California Institute of Technology","dataVisibility":"private","size":31537610933565,"objectCount":469323,"fieldOfScience":"Astronomy and Astrophysics","numberOfDatasets":1581,"rank":0,"inProgress":false,"display":true,"name":"Caltech Gravitational Wave Data","namespace":["/igwn/cit"],"thirtyDayReads":769728653555870,"oneYearReads":17688932216985000,"organizationUrl":"https://www.ligo.caltech.edu/","repositoryUrl":null},"igwn-kagra":{"description":"Gravitational wave data collected by the [KAGRA interferometer](https://www.virgo-gw.eu/),\na scientific device for detecting gravitational waves in the Gifu prefecture in Japan. KAGRA\ncollaborates closely with the LIGO detectors in the US to provide more accurate\ndetection of gravitational waves\n\nThis is the data not yet released to the public.\n","organization":"University of Tokyo","dataVisibility":"private","size":8246337208320,"objectCount":59849,"fieldOfScience":"Astronomy and Astrophysics","numberOfDatasets":2,"rank":0,"inProgress":false,"display":true,"name":"KAGRA Gravitational Wave Data","namespace":["/igwn/kagra"],"thirtyDayReads":0,"oneYearReads":449373434362,"organizationUrl":"https://www.u-tokyo.ac.jp/en/","repositoryUrl":{"url":"https://gwcenter.icrr.u-tokyo.ac.jp/en"}},"igwn-ligo":{"description":"Gravitational wave data collected by the [LIGO interferometer](https://www.ligo.caltech.edu/page/ligos-ifo)\ndetectors in Hanford, Washington and Livingston, Louisiana and hosted by \n[LIGO Laboratory](https://www.ligo.caltech.edu/) at Caltech. Gravitational wave data is used to detect\nblack hole collisions and other cosmic phenomena and is one piece of the NSF's multi-messenger astronomy\ninitiatives.\n\nThis is the data not yet released to the public.\n","organization":"California Institute of Technology","dataVisibility":"private","size":138805574893272,"objectCount":245785,"fieldOfScience":"Astronomy and Astrophysics","numberOfDatasets":12,"rank":0,"inProgress":false,"display":true,"name":"LIGO Gravitational Wave Data","namespace":["/igwn/ligo","/user/ligo"],"thirtyDayReads":3894337423325970,"oneYearReads":22558599596480200,"organizationUrl":"https://www.ligo.caltech.edu/","repositoryUrl":{"url":"https://www.ligo.caltech.edu"}},"igwn-shared":{"description":"Curated datasets used by members of the [LIGO Scientific Collaboration](www.ligo.org), the\n[Virgo Collaboration](https://www.virgo-gw.eu/), and the [KAGRA Collaboration](https://gwcenter.icrr.u-tokyo.ac.jp/en/organization)\nin the combined analysis of data collected from their detectors. These data consist of gravitational-wave\ndata collected at any of the four interferometers but with simulated signals, as well as some other datasets,\nused for data analysis purposes in detecting black hole collisions and other cosmic phenomena as\npart of the NSF's multi-messenger astronomy initiatives.\n\nThese data are not yet released to the public.\n","organization":"California Institute of Technology","dataVisibility":"private","size":100695746686856,"objectCount":1033379,"fieldOfScience":"Astronomy and Astrophysics","numberOfDatasets":5,"rank":0,"inProgress":false,"display":true,"name":"IGWN Shared Gravitational Wave Data","namespace":["/igwn/shared"],"thirtyDayReads":4436999054511,"oneYearReads":2668375776814160,"organizationUrl":"https://www.ligo.caltech.edu/","repositoryUrl":null},"igwn-test-write":{"description":"This is a test repository utilized by staff of the [LIGO Laboratory](https://www.ligo.caltech.edu/) at\nCaltech to test new versions ofthe [Pelican](https://pelicanplatform.org/) software and configuration, to ensure that upcoming changes\ndo not disrupt ongoing data analysis on any of the production origins. This test origin specifically\ntests the software and configuration of user-managed data analogous to that served in /igwn/cit.\n\nThis data is private.\n","organization":"California Institute of Technology","dataVisibility":"private","size":443967241,"objectCount":47,"fieldOfScience":"Astronomy and Astrophysics","numberOfDatasets":2,"rank":0,"inProgress":false,"display":false,"name":"IGWN Write Test","namespace":["/igwn/test-write"],"thirtyDayReads":0,"oneYearReads":468,"organizationUrl":"https://www.ligo.caltech.edu/","repositoryUrl":null},"igwn-test":{"description":"This is a test namespace utilized by staff of the [LIGO Laboratory](https://www.ligo.caltech.edu/) at\nCaltech to test new versions of [Pelican](https://pelicanplatform.org/) software and configuration, to ensure that upcoming changes\ndo not disrupt ongoing data analysis on any of the production origins.\n\nThis data is private.\n","organization":"California Institute of Technology","dataVisibility":"private","size":15322207341,"objectCount":1066,"fieldOfScience":"Astronomy and Astrophysics","numberOfDatasets":2,"rank":0,"inProgress":false,"display":false,"name":"IGWN Read Test","namespace":["/igwn/test"],"thirtyDayReads":0,"oneYearReads":2101389,"organizationUrl":"https://www.ligo.caltech.edu/","repositoryUrl":null},"igwn-virgo":{"description":"Gravitational wave data collected by the [VIRGO interferometer](https://www.virgo-gw.eu/),\na scientific device for detecting gravitational waves near Pisa, Italy. VIRGO\ncollaborates closely with the LIGO detectors in the US to provide more accurate\ndetection of gravitational waves\n\nThis is the data not yet released to the public.\n","organization":"European Gravitational Observatory","dataVisibility":"private","size":16106127360000,"fieldOfScience":"Astronomy and Astrophysics","numberOfDatasets":10,"rank":0,"inProgress":false,"display":true,"name":"VIRGO Gravitational Wave Data","namespace":["/igwn/virgo"],"thirtyDayReads":269563925490601,"oneYearReads":3046944641469200,"organizationUrl":"https://www.ego-gw.it/","repositoryUrl":{"url":"https://www.virgo-gw.eu"}},"jkb-lab-public":{"description":"Jessica Kendall-Bar leads a research group that integrates engineering, data science, ecology, and\nvisual storytelling/public communication to explore the behavior and physiology of marine life.\n\nHer visual data work has appeared in various media platforms—from UC San Diego news to national outlets\nlike The New York Times and The Atlantic—and has contributed to global policy efforts in areas such as\nmarine mammal protection and coral reef recovery.\n","organization":"University of California, San Diego","dataVisibility":"public","size":null,"bytesXferd":null,"url":"https://www.jessiekb.com/","fieldOfScience":"MULTI/INTERDISCIPLINARY STUDIES","numberOfDatasets":null,"rank":0,"inProgress":true,"display":false,"name":"Jessica Kendall-Bar Lab","namespace":["/jkb-lab-public"],"thirtyDayReads":0,"oneYearReads":130362,"publicObject":null,"organizationUrl":"https://www.jessiekb.com/"},"jkb-lab":{"description":"Jessica Kendall-Bar leads a research group that integrates engineering, data science, and ecology\nto explore the behavior and physiology of marine life. The data stored on the OSDF includes high-resolution\nmultimodal data such as video, GPS, and electrophysiology.\n\nThe OSDF data is catalogued on the [National Data Platform](https://nationaldataplatform.org/), enabling\ntextual, conceptual, and map-based spatiotemporal search capabilities.\n\nThe NDP project is using this dataset as inputs for a data challenge planned for Fall 2025. It also\npowers an application running on the [National Research Platform](https://nationalresearchplatform.org/)\nat .\n","organization":"University of California, San Diego","dataVisibility":"private","size":5497558138880,"objectCount":null,"fieldOfScience":"Biological and Biomedical Sciences","numberOfDatasets":null,"rank":0,"inProgress":true,"display":true,"name":"Jessica Kendall-Bar Lab","namespace":["/jkb-lab","/jkb-lab-public"],"thirtyDayReads":0,"oneYearReads":1266,"organizationUrl":"https://ucsd.edu/","repositoryUrl":{"url":"https://nationaldataplatform.org/ckandata","label":"Dataset Catalog"}},"jlab-osdf":{"description":"","organization":null,"dataVisibility":null,"size":null,"bytesXferd":null,"url":null,"fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":null,"namespace":["/jlab-osdf"],"thirtyDayReads":48830444442602,"oneYearReads":50865203251663,"organizationUrl":null},"jlab":{"description":"The Jefferson National Laboratory (JLab) operates particle accelerator\nfacilities and associated detectors for experiments like\n[GlueX](https://www.gluex.org/).\n\nJLab connects its storage to the OSDF to allow large-scale data simulation\nand reprocessing on the PATh-operated [OSPool resources](https://osg-htc.org/services/ospool/) and JLab-provided\ncapacity.\n","organization":"Jefferson National Laboratory","dataVisibility":"private","size":null,"objectCount":null,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":"JLab Simulation Datasets","namespace":["/jlab"],"thirtyDayReads":0,"oneYearReads":578041482718610,"organizationUrl":"https://www.jlab.org/","repositoryUrl":null},"kennesaw-priv":{"description":"This repository enables faculty and students at Kennesaw State University to use their\n[NSF Campus Cyberinfrastructure (CC*)](https://www.nsf.gov/funding/opportunities/cc-campus-cyberinfrastructure)\nfunded [storage](https://www.kennesaw.edu/research/centers-facilities/center-research-computing/nsf-campus-cyberinfrastructure/data-storage-project.php) ([Award #2430289](https://www.nsf.gov/awardsearch/showAward?AWD_ID=2430289&HistoricalAwards=false))\nwith their local HPC cluster via OSDF.\n","organization":"Kennesaw State University","dataVisibility":"private","size":null,"objectCount":null,"fieldOfScience":"Multi/Interdisciplinary Studies","numberOfDatasets":null,"rank":0,"inProgress":true,"display":true,"name":"Kennesaw State University CC* Storage","namespace":["/kennesaw-priv","/kennesaw"],"thirtyDayReads":46,"oneYearReads":7708,"organizationUrl":"https://www.kennesaw.edu/","repositoryUrl":null},"kennesaw":{"description":"","organization":null,"dataVisibility":null,"size":null,"bytesXferd":null,"url":null,"fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":null,"namespace":["/kennesaw"],"thirtyDayReads":1322040,"oneYearReads":1322040,"publicObject":null,"organizationUrl":null},"knightlab":{"description":"The Knight Lab uses and develops state-of-the-art computational and experimental\ntechniques to ask fundamental questions about the evolution of the composition of biomolecules,\ngenomes, and communities in different ecosystems, including the complex microbial ecosystems of the human body. \n","organization":"University of California, San Diego","dataVisibility":"public","size":null,"objectCount":null,"fieldOfScience":"Biological and Biomedical Sciences","numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":"UCSD Knight Lab","namespace":["/knightlab"],"thirtyDayReads":0,"oneYearReads":258218216565,"publicObject":null,"organizationUrl":"https://knightlab.ucsd.edu/","repositoryUrl":null},"mals":{"description":"The MeerKAT Absorption Line Survey (MALS) consists of 1,655 hours of observatory time on the\n[MeerKAT](https://www.sarao.ac.za/science/meerkat/about-meerkat/) radio telescope at the South\nAfrican Radio Astronomy Observatory. The survey aims\nto carry out the most sensitive search of HI and OH absorption lines at 0\n\n
\n\n*Visualization of ocean temperature on January 16, 2014.*\n\n\"Ocean\n\nThe integration between NCAR and OSDF is part of the [Pathfinders collaboration](https://ndc-pathfinders.org),\na collaboration between five initiatives aimed at developing science-led pathways through the NSF cyberinfrastructure\nlandscape. This work is funded by NSF award [1852977](https://www.nsf.gov/awardsearch/showAward?AWD_ID=1852977).\n","organization":"NSF National Center for Atmospheric Research","dataVisibility":"public","size":null,"objectCount":null,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":2,"inProgress":false,"display":true,"name":"NCAR Research Data Archive","namespace":["/ncar","/ncar-rda"],"thirtyDayReads":218481971416683,"oneYearReads":264353361185352,"organizationUrl":"https://ncar.ucar.edu","repositoryUrl":{"url":"https://rda.ucar.edu","label":"Dataset Catalog"},"publicObject":"/ncar/rda/d208000/index.html"},"ndp-burnpro3d-auth":{"description":"A century of suppressing wildfires has created a dangerous accumulation of flammable vegetation on landscapes,\ncontributing to megafires that risk human life and destroy ecosystems. Prescribed burns can dramatically reduce\nthe risk of large fires that are uncontrollable by decreasing this buildup of fuels. BurnPro3D is a science-driven,\ndecision-support platform to help the fire management community understand risks and tradeoffs quickly and\naccurately when planning and conducting prescribed burns.\n","organization":"University of California, San Diego","dataVisibility":"private","size":null,"objectCount":null,"fieldOfScience":"Natural Resources and Conservation","numberOfDatasets":null,"rank":0,"inProgress":true,"display":false,"name":"BurnPro3D","namespace":["/ndp/burnpro3d-auth"],"thirtyDayReads":0,"oneYearReads":9213605566,"organizationUrl":"https://wifire.ucsd.edu/burnpro3d","repositoryUrl":{"url":"https://wifire-data.sdsc.edu/dataset","label":"Dataset Catalog"}},"ndp-burnpro3d":{"description":"A century of suppressing wildfires has created a dangerous accumulation of flammable vegetation on landscapes,\ncontributing to megafires that risk human life and destroy ecosystems. Prescribed burns can dramatically reduce\nthe risk of large fires that are uncontrollable by decreasing this buildup of fuels. BurnPro3D is a science-driven,\ndecision-support platform to help the fire management community understand risks and tradeoffs quickly and\naccurately when planning and conducting prescribed burns.\n","organization":"University of California, San Diego","dataVisibility":"public","size":null,"objectCount":null,"fieldOfScience":"Natural Resources and Conservation","numberOfDatasets":1,"rank":1,"inProgress":false,"display":true,"name":"BurnPro3D","namespace":["/ndp/burnpro3d"],"thirtyDayReads":316964599,"oneYearReads":9337358188,"publicObject":"/ndp/burnpro3d/YosemiteBurnExample/burnpro3d-yosemite-example.csv","organizationUrl":"https://wifire.ucsd.edu/burnpro3d","repositoryUrl":{"url":"https://wifire-data.sdsc.edu/dataset","label":"Dataset Catalog"}},"noaa-fisheries":{"description":"NOAA collects and uses active acoustic (or sonar) data for a variety of\nmapping requirements. Water column sonar data focus on the area from near\nthe surface of the ocean to the seafloor. Primary uses of these specific\nsonar data include 3-D mapping of fish schools and other mid-water marine\norganisms; assessing biological abundance; species identification; and\nhabitat characterization. Other uses include mapping underwater gas seeps\nand remotely monitoring undersea oil spills. NCEI archives water column\nsonar data collected by NOAA line offices, academia, industry, and\ninternational institutions.\n","organization":"National Oceanic and Atmospheric Administration","organizationUrl":"https://www.noaa.gov/","repositoryUrl":{"url":"https://www.ncei.noaa.gov/products/water-column-sonar-data"},"fieldOfScience":"Fishing and Fisheries Sciences and Management","numberOfDatasets":1,"dataVisibility":"public","publicObject":"/noaa/fisheries-1/noaa-wcsd-pds/data/raw/Henry_B._Bigelow/HB2403/README_HB2403_EK80.md","size":331978855304407,"display":true,"rank":1,"inProgress":false,"name":"NCEI Water Column Sonar Data","namespace":["/noaa/fisheries-1","/noaa/fisheries-2"],"thirtyDayReads":null,"oneYearReads":null},"nrao-ardg":{"description":"Radio astronomy data from the Very Large Array Sky Survey (VLASS).\n\nAs written in the [VLASS homepage](https://public.nrao.edu/vlass/),\nVLASS is a survey of the universe through the use of the Very Large\nArray (VLA) in New Mexico. The VLA is one of the most sensitive telescopes\nin the radio band that can provide more sensitive images of the universe\nthan any other radio telescope in the world. This, however, requires processing\nlarge volumes of data and super-computer class computing\nresources. The VLASS is designed to produce a large collection\nof radio data available to wide range of scientists within the astronomical\ncommunity. VLASS's science goal is to produce a radio, all-sky survey that\nwill benefit the entire astronomical community. As VLASS completes its three\nscans of the sky separated by approximately 32 months, new developments in\ndata processing techniques will allow scientists an opportunity to download data\ninstantly on potentially millions of astronomical radio sources.\n\nThe data in this data origin consists of interferometric visibilities stored in \n([Measurement Set (MS)](https://casa.nrao.edu/Memos/229.html)) format. Each\ndataset contains calibrated visibilities for one of the sixteen spectral windows\nof the VLA and covers an area of 4 square degrees (2 degrees x 2 degrees) in the\nsky. All sixteen spectral windows are combined to generate a single image, so that\nthe data contained in this data origin can be used to make images of approximately \n70 regions in the sky, each image covering 4 square degrees. The [LibRA software\npackage](https://github.com/ARDG-NRAO/LibRA) is used to transform visibilities to\nimages. The architecture and design considerations for LibRA are shown in [this\npresentation](https://www.aoc.nrao.edu/~sbhatnag/Talks/For_BrianB.pdf).\n\nTeams of scientists at the [National Radio Astronomy Observatory\n(NRAO)](http://www.nrao.edu), Socorro, NM and the Center for High Throughput Computing\n(CHTC) have used the PATh and NRP facilities of the OSG to make the\ndeepest image in the radio band of the Hubble Ultra-deep Field\n(HUDF). Similarly, the COSMOS HI Large Extra Galactic Survey\n(CHILES)[http://chiles.astro.columbia.edu/] project has 1000 hr of integration with the VLA on the\nCOSMOS field. Imaging the CHILES data using PATh and NRP facilities\ndelivered the deepest radio image of this region of the sky, at an\nunmatched data processing throughput. Similarly to the VLASS data stored in this\ndata origin, the data for HUDF and CHILES is stored in the PATh facility data origin.\nThese recent large scale imaging achievements that were made possible through\nuse of OSG resources are reported in this [NRAO Newsletter article]\n(https://science.nrao.edu/enews/17.3/index.shtml#deepimaging) and [this press\nrelease](https://public.nrao.edu/news/astronomers-study-the-universe-300-times-faster/).\n","organization":"National Radio Astronomy Observatory","dataVisibility":"public","size":4068193022771,"objectCount":36068,"fieldOfScience":"Astronomy and Astrophysics","numberOfDatasets":15962,"rank":0,"inProgress":false,"display":true,"name":"NRAO VLASS","namespace":["/nrao-ardg"],"thirtyDayReads":16440925390027,"oneYearReads":280427321936701,"organizationUrl":"https://public.nrao.edu/vlass/","repositoryUrl":null,"publicObject":"/nrao-ardg/fmadsen/vlass-32PIMS/data/T23t17/J161533+503000/VLASS2.1.sb38528342.eb38565674.59072.03519471065_split_SPW0.ms.tgz"},"nrp-cachetest":{"description":"Namespace used by Fabio for ongoing CheckMK testing of NRP caches\n","organization":"University of California, San Diego","dataVisibility":"public","size":null,"objectCount":null,"fieldOfScience":"Computer and Information Sciences and Support Services","numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":"CheckMK Probe Namespace","namespace":["/nrp/cachetest"],"thirtyDayReads":854213001500,"oneYearReads":2395013261465260,"publicObject":null,"organizationUrl":null,"repositoryUrl":null},"nrp-osdf":{"description":"","organization":null,"dataVisibility":null,"size":null,"bytesXferd":null,"url":null,"fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":null,"namespace":["/nrp/osdf"],"thirtyDayReads":65536,"oneYearReads":1475164654373560,"publicObject":null,"organizationUrl":null},"nrp-protected-xenon-biggrid-nl":{"description":"The [XENON Dark Matter Project](https://xenonexperiment.org/) is a scientific\ncollaboration organized around the XENONnT dark matter detector at the INFN\n[Gran Sasso National Laboratory](https://www.lngs.infn.it/en/lngs-overview) in\nGran Sasso, Italy.\n\nThis repository is used to store data and simulations from the XENONnT experiment\nto aid in its computing workloads.\n","organization":"University of Chicago","dataVisibility":"private","size":null,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":"XENONnT Dark Matter","namespace":["/nrp/protected/xenon-biggrid-nl/"],"thirtyDayReads":0,"oneYearReads":152114667948168,"organizationUrl":"https://www.uchicago.edu/en","repositoryUrl":{"url":"https://xenonexperiment.org/"}},"nrp-sio":{"description":"Scripps Institution of Oceanography scientists conduct fundamental research to\nunderstand and protect the planet, and investigate our oceans, Earth, and atmosphere\nto find solutions to our greatest environmental challenges.\n","organization":"Scripps Institute of Oceanography","dataVisibility":"private","size":null,"objectCount":null,"fieldOfScience":"Biological and Biomedical Sciences","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"Scripps Institute of Oceanography","namespace":["/nrp/sio"],"thirtyDayReads":38384294620,"oneYearReads":38428297113,"organizationUrl":"https://scripps.ucsd.edu/","repositoryUrl":null},"nsdf":{"description":"","organization":"Morgridge Institute for Research","dataVisibility":"public","size":null,"bytesXferd":null,"url":"https://morgridge.org/research/research-computing/","fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":null,"namespace":["/nsdf"],"thirtyDayReads":5692199277,"oneYearReads":51536059619,"publicObject":null,"organizationUrl":"https://morgridge.org/research/research-computing/"},"osdf-tutorial":{"description":"Datasets for use in OSDF usage tutorials by [Pelican Platform](https://pelicanplatform.org) facilitation team.\n\nThis repository supports the education and workforce development mission of the\nPelican Project.\n","organization":"University of Wisconsin - Madison","dataVisibility":"private","size":2416,"fieldOfScience":"Computer and Information Sciences","numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":"OSDF Tutorial Data","namespace":["/osdf-tutorial"],"thirtyDayReads":0,"oneYearReads":62261,"organizationUrl":"https://wisc.edu/","repositoryUrl":{"url":"https://github.com/osg-htc/tutorial-osdf-noaa/blob/main/01-get-and-share-objects.ipynb"}},"osn-sdsc":{"description":"","organization":null,"dataVisibility":null,"size":null,"bytesXferd":null,"url":null,"fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":null,"namespace":["/osn-sdsc"],"thirtyDayReads":67371008,"oneYearReads":373263780714960,"organizationUrl":null},"ospool-ap20-ap21":{"description":"Staging area for [PATh](https://path-cc.io/)-operated Access Points located at the University of Chicago.\n\nThe PATh project allows researcher teams to stage their research data\nto an object store connected to the OSDF and then process and analyze the data using\nthe OSDF via the [OSPool](https://osg-htc.org/ospool). Any US-based open science team\ncan utilize the PATh services for distributed High Throughput Computing workflows.\n\nThis data is organized as \"working datasets\" representing running workloads, not\npermanent scientific outputs.\n","organization":"University of Chicago","dataVisibility":"private","size":39000000000000,"fieldOfScience":"Multi/Interdisciplinary Studies","numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":"OSPool AP Working Data","namespace":["/ospool/ap20","/ospool/ap21"],"thirtyDayReads":2700344994763630,"oneYearReads":44089309792954600,"organizationUrl":"https://www.uchicago.edu/en","repositoryUrl":{"url":"https://osg-htc.org/services/ospool/"}},"ospool-ap22":{"description":"","organization":null,"dataVisibility":null,"size":null,"bytesXferd":null,"url":null,"fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":null,"namespace":["/ospool/ap22"],"thirtyDayReads":0,"oneYearReads":91406519414,"organizationUrl":null},"ospool-ap40-data":{"description":"Staging area for [PATh](https://path-cc.io/)-operated Access Points located at the University of Wisconsin-Madison.\n\nThe PATh project allows researcher teams to stage their research data\nto an object store connected to the OSDF and then process and analyze the data using\nthe OSDF via the [OSPool](https://osg-htc.org/ospool). Any US-based open science team\ncan utilize the PATh services for distributed High Throughput Computing workflows.\n\nThis data is organized as \"working datasets\" representing running workloads, not\npermanent scientific outputs.\n","organization":"University of Wisconsin - Madison","dataVisibility":"private","size":7460424841729,"bytesXferd":null,"url":"https://osg-htc.org/services/ospool/","fieldOfScience":"Multi/Interdisciplinary Studies","numberOfDatasets":null,"rank":0,"inProgress":false,"display":true,"name":"OSPool AP Working Data","namespace":["/ospool/ap40/data"],"thirtyDayReads":1529524963342780,"oneYearReads":11140957788277200,"organizationUrl":"https://www.wisc.edu/"},"ospool-uc-shared-project":{"description":"Staging area for [PATh](https://path-cc.io/)-operated collaboration services located at the University of Chicago.\n\nThe PATh project allows multi-institutional collaborations to stage their experimental data\nand simulation outputs to an object store connected to the OSDF and then process and analyze the data using\nthe OSDF via the [OSPool](https://osg-htc.org/ospool) or other capacity dedicated to their\nexperiment.\n\nThis data is organized as \"working datasets\" representing running workloads, not\npermanent scientific outputs.\n","organization":"University of Chicago","dataVisibility":"private","size":193000000000000,"fieldOfScience":"Multi/Interdisciplinary Studies","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"PATh Collaboration Services","namespace":["/ospool/uc-shared/project","/ospool/uc-shared/public"],"thirtyDayReads":21723475821543,"oneYearReads":6775986102458060,"organizationUrl":"https://www.uchicago.edu/en","repositoryUrl":{"url":"https://osg-htc.org/collaboration-support/"},"publicObject":"/ospool/uc-shared/public/eht/GRMHD_kharma-v3/Ma+0.94_w5/torus.out0.05986.h5"},"ospool-uc-shared-public":{"description":"Data staging area for OSPool projects with public data\n","organization":"University of Chicago","dataVisibility":"public","size":10600000000000,"bytesXferd":null,"url":"https://osg-htc.org/services/ospool/","fieldOfScience":"Multi/Interdisciplinary Studies","numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":"OSPool Public Staging","namespace":["/ospool/uc-shared/public"],"publicObject":null,"thirtyDayReads":1325265802131560,"oneYearReads":2426417531645050,"organizationUrl":"https://osg-htc.org/services/ospool/"},"path-facility-data":{"description":"Staging area for data used in the [PATh Facility](https://path-cc.io/facility/index.html).\nThe PATh Facility is a distributed computing resource spanning 5 sites, from San Diego, California\nto Syracuse, New York, that provides NSF-funded researches with compute credits for High Throughput\nComputing workflows.\n\nThis repository enables these NSF projects to stage their research data\noutputs to an object store connected to the OSDF and then process and analyze the data using\nthe OSDF via both the PATh Facility computing hardware and the [OSPool](https://osg-htc.org/ospool).\n\nThis data is organized as \"working datasets\" representing active workloads from researchers, not\npermanent scientific outputs.\n","organization":"University of Wisconsin - Madison","dataVisibility":"private","size":13958398725,"fieldOfScience":"Multi/Interdisciplinary Studies","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"PATh Facility Researcher Data","namespace":["/path-facility/data"],"thirtyDayReads":7134824038133,"oneYearReads":1241810445184950,"organizationUrl":"https://wisc.edu/"},"path-facility-projects":{"description":"Special projects data in the PATh facility.\n\nTo avoid redundancy, focus on `/path-facility/data` instead.\n","organization":"University of Wisconsin - Madison","dataVisibility":"private","size":71762120,"fieldOfScience":"Multi/Interdisciplinary Studies","numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"namespace":["/path-facility/projects"],"thirtyDayReads":29523415051,"oneYearReads":681346253116,"organizationUrl":"https://wisc.edu","repositoryUrl":null},"pelican-monitoring":{"description":"","organization":null,"dataVisibility":null,"size":null,"bytesXferd":null,"url":null,"fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"namespace":["/pelican/monitoring"],"thirtyDayReads":483963111,"oneYearReads":6313498957,"organizationUrl":null},"pelicanfacilitation":{"description":"A namespace for the [Pelican Platform](https://pelicanplatform.org/) facilitation team to use for a variety of facilitation purposes.\n","organization":"University of Wisconsin-Madison","dataVisibility":"public","size":null,"fieldOfScience":"Computer and Information Sciences and Support Services","numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":"Pelican Facilitation Test Data","namespace":["/pelicanfacilitation"],"thirtyDayReads":0,"oneYearReads":0,"publicObject":null,"organizationUrl":"https://wisc.edu","repositoryUrl":null,"publicObjectUrl":null},"pelicanplatform":{"description":"Testing and Validation Origin\n","organization":null,"dataVisibility":"public","publicObject":null,"size":8132,"bytesXferd":null,"url":"https://pelicanplatform.org","fieldOfScience":"Computer and Information Sciences and Support Services","numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"namespace":["/pelicanplatform"],"thirtyDayReads":61976849721,"oneYearReads":9200333527280,"organizationUrl":"https://pelicanplatform.org"},"pnfs-fnalgov-des":{"description":"The Dark Energy Survey (DES) will probe the origin of the accelerating universe and help uncover the nature of dark energy\nby measuring the 14-billion-year history of cosmic expansion with high precision. A 570M-pix camera, the DECam, is being\nbuilt for this project and comprehensive tests were successfully accomplished at Fermilab's telescope simulator (pictured above).\nAs we countdown to DECam's first light, workload and excitement increase among our collaborators. Starting in late 2011 and\ncontinuing for five years, DES will survey a large swath of the southern sky out to vast distances in order to provide new clues\nto this most fundamental of questions.\n\nDES uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.\n","organization":"Fermi National Accelerator Laboratory","dataVisibility":"public","size":136000000000,"objectCount":36656,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"Dark Energy Survey","namespace":["/pnfs/fnal.gov/usr/des"],"thirtyDayReads":null,"oneYearReads":null,"publicObject":"/pnfs/fnal.gov/usr/des/persistent/stash/gw/ALLWISE_AGN/allwiseagn_v1_082022.dat","organizationUrl":"https://www.fnal.gov","repositoryUrl":{"url":"https://astro.fnal.gov/the-des-project/"}},"pnfs-fnalgov-dune":{"description":"The Deep Underground Neutrino Experiment is an international flagship experiment to unlock the mysteries of neutrinos.\nDUNE scientists will paint a clearer picture of the universe and how it works. Their research may even give us the key\nto understanding why we live in a matter-dominated universe — in other words, why we are here at all.\n\nDUNE will pursue three major science goals: find out whether neutrinos could be the reason the universe is made of matter;\nlook for subatomic phenomena that could help realize Einstein's dream of the unification of forces; and watch for neutrinos\nemerging from an exploding star, perhaps witnessing the birth of a neutron star or a black hole.\n\nDUNE uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.\n","organization":"Fermi National Accelerator Laboratory","dataVisibility":"public","size":7400000000000,"objectCount":1407846,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"DUNE","namespace":["/pnfs/fnal.gov/usr/dune"],"publicObject":"/pnfs/fnal.gov/usr/dune/persistent/stash/Flux/Supernova/v1/gvkm_nue_spectrum.root","organizationUrl":"https://fnal.gov/","repositoryUrl":{"url":"https://lbnf-dune.fnal.gov/"}},"pnfs-fnalgov-icarus":{"description":"The ICARUS neutrino detector measures 65 feet long and weighs 760 tons. It began its life in [Gran Sasso Laboratory](https://www.lngs.infn.it/en/lngs-overview) in\nItaly, seeking out elusive particles using pioneering technology. It later spent two years undergoing upgrades at [CERN](https://cern.ch/),\nthe European particle physics laboratory and home of the Large Hadron Collider. It moved to Fermilab in 2017 and was\ninstalled in its detector hall in 2018, where along with the new Cosmic Ray Tagger it forms the far detector for the\nShort-Baseline Neutrino program.\n\nThe ICARUS collaboration is investigating signs of physics that may point to a new kind of neutrino called the sterile\nneutrino. Other experiments have made measurements that suggest a departure from the standard three-neutrino model. ICARUS\nis also investigating the various probabilities of a neutrino interacting with different types of matter as well as\nneutrino-related astrophysics topics.\n\nICARUS uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.\n","organization":"Fermi National Accelerator Laboratory","dataVisibility":"public","size":32000000000,"objectCount":32767,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"ICARUS","namespace":["/pnfs/fnal.gov/usr/icarus"],"publicObject":null,"organizationUrl":"https://fnal.gov/","repositoryUrl":{"url":"https://icarus.fnal.gov/"}},"pnfs-fnalgov-minerva":{"description":"MINERvA (Main Injector Neutrino ExpeRiment to study v-A interactions) is the first neutrino experiment in the\nworld to use a high-intensity beam to study neutrino reactions with five different nuclei, creating the first\nself-contained comparison of interactions in different elements. While this type of study has previously been\ndone using beams of electrons, this is a first for neutrinos.\n\nMINERvA is providing the world's best, high-precision measurements of neutrino interactions on various nuclei,\nin the 1-to 10-GeV energy range. MINERvA's results are being used as inputs to current and future experiments\nseeking to study neutrino oscillations, or the ability of neutrinos to change their type.\n\nMINERvA uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.\n","organization":"Fermi National Accelerator Laboratory","dataVisibility":"public","size":34500000000,"objectCount":35339,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"MINERvA","namespace":["/pnfs/fnal.gov/usr/minerva"],"publicObject":"/pnfs/fnal.gov/usr/minerva/persistent/stash/mc_generation_flux/mc-flux/mc/g4numiv6/00/00/00/06/g4numiv6_dk2nu_minervamebar_me000z-200i_0000_0006.root","organizationUrl":"https://fnal.gov/","repositoryUrl":{"url":"https://minerva.fnal.gov/"}},"pnfs-fnalgov-nova":{"description":"The NOvA (NuMI Off-axis ve Appearance) experiment is shedding light on one of nature's most elusive particles: neutrinos.\nSince the late 1990s, physicists have known that neutrinos exhibit a quantum mechanical behavior called oscillations. But\nthis behavior is not predicted by the Standard Model of particle physics. NOvA is working to better understand these strange\nparticles through precision measurements of their oscillation properties.\n\nNOvA uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.\n","organization":"Fermi National Accelerator Laboratory","dataVisibility":"public","size":10800000000000,"objectCount":535751,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"NOVA","namespace":["/pnfs/fnal.gov/usr/nova"],"publicObject":"/pnfs/fnal.gov/usr/nova/persistent/stash/flux/g4numi/v6r1b/me000z200i/g4numiv6_minervame_me000z200i_0_0005.root","organizationUrl":"https://fnal.gov/","repositoryUrl":{"url":"https://novaexperiment.fnal.gov/"}},"pnfs-fnalgov-sbn":{"description":"The international Short-Baseline Neutrino Program at Fermilab examines the properties of neutrinos,\nspecifically how the flavor of a neutrino changes as it moves through space and matter. The program\nemerged from a joint proposal, submitted by three scientific collaborations, to use particle detectors\nto perform sensitive searches for ve appearance and νμ disappearance in the Booster Neutrino Beam. All\nof the detectors are types of liquid-argon time projection chambers, and each contributes to the\ndevelopment of this particle detection technology for the long-baseline Deep Underground Neutrino Experiments\n(DUNE).\n\nSBN uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.\n","organization":"Fermi National Accelerator Laboratory","dataVisibility":"public","size":761000000000,"objectCount":78747,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"Short-Baseline Neutrino Program","namespace":["/pnfs/fnal.gov/usr/sbn"],"publicObject":"/pnfs/fnal.gov/usr/sbn/persistent/stash/physics/beam/GENIE/BNB/standard/v01_00/converted_beammc_icarus_0113.root","organizationUrl":"https://fnal.gov/","repositoryUrl":{"url":"https://sbn.fnal.gov/"}},"pnfs-fnalgov-sbnd":{"description":"The [Short-Baseline Near Detector](https://sbn-nd.fnal.gov/) (SBND) is a 112-ton active mass liquid argon time projection chamber (LArTPC)\nneutrino detector that sits only 110-m from the target of the Booster Neutrino Beam (BNB) at Fermilab. SBND is\nthe near detector in the Short-Baseline Neutrino Program. ICARUS is the far detector in the program, and\nMicroBooNE ran previously in the same beam.\n\nSBND will record over a million neutrino interactions per year. By providing such a high statistics measurement\nof the un-oscillated content of the BNB, SBND plays a critical role in performing searches for neutrino oscillations\nat the SBN Program. The large data sample will also allow studies of neutrino-argon interactions in the GeV energy\nrange with unprecedented precision. The physics of these interactions is an important element of future neutrino\nexperiments that will employ the LArTPC technology, such as the long-baseline Deep Underground Neutrino Experiment, [DUNE](https://www.dunescience.org/).\n\nSBND uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.\n","organization":"Fermi National Accelerator Laboratory","dataVisibility":"public","size":503000000000,"objectCount":60125,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"Short-Baseline Near Detector","namespace":["/pnfs/fnal.gov/usr/sbnd"],"publicObject":"/pnfs/fnal.gov/usr/sbnd/persistent/stash/fluxFiles/bnb/BooNEtoGSimple/configK-v1/july2023/neutrinoMode/gsimple_april07_baseline_0019_redecay_wkaonwgh.root","organizationUrl":"https://fnal.gov/","repositoryUrl":{"url":"https://sbn-nd.fnal.gov/"}},"pnfs-fnalgov-uboone":{"description":"MicroBooNE is a large 170-ton liquid-argon time projection chamber (LArTPC) neutrino experiment located on the Booster\nneutrino beamline at Fermilab. The experiment first started collecting neutrino data in October 2015.\n\nMicroBooNE investigates the low energy excess events observed by the MiniBooNE experiment, measure a suite of low\nenergy neutrino cross sections, and investigate astro-particle physics.\n\nMicroBooNE uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.\n","organization":"Fermi National Accelerator Laboratory","dataVisibility":"public","size":3500000000000,"objectCount":356300,"fieldOfScience":"Physical Sciences","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"MicroBooNE","namespace":["/pnfs/fnal.gov/usr/uboone"],"publicObject":"/pnfs/fnal.gov/usr/uboone/persistent/stash/wcp_ups/wcp/releases/tag/v00_10_00/input_data_files/XGB_nue_seed2_0923.xml","organizationUrl":"https://fnal.gov/","repositoryUrl":{"url":"https://microboone.fnal.gov/"}},"purdue":{"description":"General namespace for Purdue University OSStore contribution.\n","organization":"Purdue University","dataVisibility":"public","size":null,"fieldOfScience":"Multi/Interdisciplinary Studies","numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":"Purdue University OSStore","namespace":["/purdue"],"thirtyDayReads":0,"oneYearReads":0,"organizationUrl":"https://purdue.edu/","repositoryUrl":null},"routeviews":{"description":"The RouteViews dataset provides a map of the Internet, as seen by participating\nsites. The information, collected from the [BGP](https://en.wikipedia.org/wiki/Border_Gateway_Protocol)\ntables of routers, includes both current and historic \"snapshots\". This allows\noperators of major Internet services to detect changes to the map in near-real\ntime and for researchers to understand the historical evolution of the Internet.\n\nThe RouteViews dataset is funded by University of Oregon's\n[Advanced Network Technology Center](https://web.archive.org/web/20200428083158/http://antc.uoregon.edu/),\nand by grants from the [National Science Foundation](https://www.nsf.gov/),\n[Cisco Systems](https://www.cisco.com/), the [Defense Advanced Research Projects Agency](https://www.darpa.mil/),\n[Juniper Networks](https://www.juniper.net/), Sprint Advanced Technology Laboratories,\n[Catchpoint](https://catchpoint.com/) and the providers who graciously provide their BGP views.\n","organization":"University of Oregon","organizationUrl":"https://www.uoregon.edu/","repositoryUrl":{"url":"https://www.routeviews.org/routeviews/"},"fieldOfScience":"Computer Systems Networking and Telecommunications","numberOfDatasets":1,"dataVisibility":"public","publicObject":"/routeviews/chicago/route-views.chicago/bgpdata/2025.03/RIBS/rib.20250319.0400.bz2","size":113627027734528,"display":true,"rank":1,"inProgress":false,"thirtyDayReads":252838517,"oneYearReads":252838638,"name":"RouteViews","namespace":["/routeviews"]},"sage-backup":{"description":"","organization":null,"dataVisibility":null,"size":null,"bytesXferd":null,"url":null,"fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":null,"namespace":["/sage-backup"],"thirtyDayReads":0,"oneYearReads":71571001350,"organizationUrl":null},"sage":{"description":"The Sage project provides a platform for AI computing at the edge. It operates\na nationwide infrastructure of distributed sensors - from urban landscapes to remote\nmountainsides - that collect, process using AI techniques, and aggregate data.\n\nWith over 100 Sage nodes deployed across 17 states, including fire-prone regions in\nthe Western U.S., the platform supports rapid-response science and sustained observation\nof ecological systems, agriculture, urban environments, and weather-related hazards.\n\nSage uploads its data into NSF CC* funded storage systems connected to the OSDF. Data\naccess requires a Sage account; more information can be found in [the Sage documentation](https://sagecontinuum.org/docs/tutorials/accessing-data) and tutorials.\n","organization":"Northwestern University","dataVisibility":"private","size":116631128783045,"fieldOfScience":"Computer and Information Sciences and Support Services","numberOfDatasets":null,"rank":1,"inProgress":false,"display":true,"name":"Sage AI at the Edge","namespace":["/sage"],"organizationUrl":"https://www.northwestern.edu/","repositoryUrl":{"url":"https://sagecontinuum.org/"},"thirtyDayReads":null,"oneYearReads":null},"spin4d":{"description":"The [SPIn4D project](https://ifauh.github.io/SPIN4D/) (Spectropolarimetric Inversion in Four Dimensions with Deep Learning)\ndevelops neural networks to help prepare for the huge amount of solar data coming\nfrom the NSF-funded [Inouye Solar Telescope](https://nso.edu/telescopes/inouye-solar-telescope/),\nthe most powerful solar telescope in the world.\n\nSPIn4D's [data release one](http://dtn-itc.ifa.hawaii.edu/spin4d/DR1/) is 109TB of simulated small-scale dynamo\nactions accompanying the project's first paper. A [corresponding Jupyter notebook](https://github.com/ifauh/spin4d-data/blob/main/spin4d-data-exploration.ipynb)\nillustrates how to access and use the data via the OSDF using the [Pelican](https://pelicanplatform.org) clients.\nThe dataset is also [accessible](https://www.linkedin.com/pulse/ndp-action-astronomical-data-size-national-data-platform-lfwnc/)\nvia the [National Data Platform](https://nationaldataplatform.org/).\n\nFor more information, see the [accompanying spotlight article](https://pelicanplatform.org/news/2024/12/20/sun-secrets).\n","organization":"University of Hawaii-Moana","dataVisibility":"public","size":119846767427584,"objectCount":null,"fieldOfScience":"Physical Sciences","numberOfDatasets":6,"rank":2,"inProgress":false,"display":true,"name":"SPIN4D Data Release 1","namespace":["/uhkoa/SPIN4D-DR1"],"publicObject":"/uhkoa/SPIN4D-DR1/SPIN4D_SSD_50G_V/subdomain_9.054681","organizationUrl":"https://manoa.hawaii.edu/","repositoryUrl":{"url":"http://dtn-itc.ifa.hawaii.edu/spin4d/DR1/"}},"ucsd-physics":{"description":"","organization":null,"dataVisibility":null,"size":null,"bytesXferd":null,"url":null,"fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":null,"namespace":["/ucsd/physics"],"thirtyDayReads":12836138539,"oneYearReads":2005778190351,"publicObject":null,"organizationUrl":null},"uhkoa":{"description":"The KoaStore repository is a high performance and scalable parallel file system\nstorage solution that can be used by University of Hawai'i faculty and staff. KoaStore\nwas [funded](https://www.hawaii.edu/news/2022/10/21/500k-boosts-data-intensive-research/)\nthrough the [NSF Campus Cyberinfrastructure]() program through [award #2232862](https://www.nsf.gov/awardsearch/showAward?AWD_ID=2232862).\n\nKoaStore users provide datasets such as [SPIN4D](https://ifauh.github.io/SPIN4D/) accessible\nvia the OSDF.\n","organization":"University of Hawai'i","dataVisibility":"public","size":null,"fieldOfScience":"Multi/Interdisciplinary Studies","numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":"University of Hawai'i KoaStore","namespace":["/uhkoa"],"thirtyDayReads":0,"oneYearReads":1669909818762,"publicObject":null,"organizationUrl":"https://www.hawaii.edu/","repositoryUrl":{"url":"https://datascience.hawaii.edu/koa-research-storage-service/"}},"user-ligo":{"description":"","organization":null,"dataVisibility":null,"size":null,"bytesXferd":null,"url":null,"fieldOfScience":null,"numberOfDatasets":null,"rank":0,"inProgress":false,"display":false,"name":null,"namespace":["/user/ligo"],"thirtyDayReads":26757304214,"oneYearReads":149357052994448,"organizationUrl":null}}

GlueX is an experiment at the Thomas Jefferson National Accelerator Facility (JLab) in Newport News, Virginia, that studies how particles called mesons behave to learn more about the strong force—the force that holds atomic nuclei together. The dataset from GlueX comes from millions of collisions between high-energy photons and protons. GlueX uses the OSDF to distribute inputs to its data simulations and is exploring using OSDF for reprocessing.

GlueX is supported by the US Department of Energy.

Experiments related to the Virtual Data Collaboratory at the Scientific Computing and Imaging Institute at the University of Utah.

These cyberinfrastructure experiments include activities like running automated workflows on the OSPool triggered on alerts from the EarthScope Consortium.

AWS Open Data hosts publicly accessible datasets covering areas such as earth science, climate, genomics, machine learning, transportation, and economics. The collection includes contributions from a range of organizations, including government agencies, academic institutions, and private companies.

There are currently nearly 700 datasets, totaling over 100 petabytes of data.

Browse the full catalog at the Registry of Open Data on AWS.

The AWS Open Data datasets are publicly accessible and are integrated with the OSDF, allowing users to stage the data closer to nationally-funded computing resources via the OSDF’s hardware infrastructure. This enables fusion between AWS Open Data and other data sources accessible via the OSDF.

The Center for Applied Internet Data Analysis (CAIDA) runs an “Network Telescope”, collecting packets sent to a cross-section of the public Internet similarly to how a telescope collects stray light.

This dataset is made available to scientists attempting to understand how activity, such as malware, is moving across the Internet.

The CAIDA integration with OSDF aims to stage the most recent subset of the recorded data to be made available for large-scale analysis.

Staging data for CHTC collaborations with University of Wisconsin-Madison research groups. Currently serving data specifically for the Joao Dorea group.

The Center for High Throughput Computing (CHTC), established in 2006, aims to bring the power of High Throughput Computing to all fields of research, and to allow the future of HTC to be shaped by insight from all fields.

Beyond technologies and innovation and HTC through projects like HTCondor, the CHTC operates general purpose clusters for the UW-Madison campus. CHTC allows researchers to stage their research data to an object store connected to the OSDF and then process and analyze the data using the OSDF with on-campus resources or the OSPool.

This data is organized as “working datasets” representing running workloads, not permanent scientific outputs.

The Electron-Ion Collider is a proposed facility being built at the Brookhaven National Laboratory. Experiments at the facility include the ePIC detector. The computing for EIC is a joint collaboration with the Jefferson National Lab; the datasets connected to the OSDF include input files and other information necessary to help with simulations of the detector’s behavior.

The South Florida region is home to nearly 10 million people, and the population is growing. The region faces several challenges, such as rising sea levels and flooding, harmful algae blooms, water contamination, and wildlife habit loss, which affects the economy and the welfare of its population. Florida International University (FIU) runs the EnviStor project, which is a centrally managed, petabyte-scale storage system that is also a clearing house for supporting interdisciplinary research and modeling involving both built and natural environments in South Florida. EnviStor provides opportunities for students and faculty to enhance their knowledge of database management, focusing on interoperability.

The datasets kept in EnviStor can be accessed via the OSDF; work is ongoing to provide new computing workflows and AI-based dataset discovery that will help users utilize the data.

The EnviStor activity and underlying storage is funded through the NSF Campus Cyberinfrastructure program under Award # 2322308.

Simulation data used for the Einstein Telescope Mock Data Challenge.

The Einstein Telescope (ET) is a proposed next-generation gravitational wave observatory, aiming to detect gravitational waves with much higher sensitivity than either the LIGO or VIRGO instruments.

As part of the studies and the design proposal for the ET instrument, the mock data challenge is being run in 2024 and 2025 to better understand how the future data may be distributed and analyzed. An example tutorial for using the data can be found on GitHub.

This is a cool description.

Screeeeeeeeeam

The Fusion Data Platform (FDP) provides a modern, Python-based data framework for analyzing data from magnetic fusion experiments.

Using data from the DIII-D National Fusion Facility, users can leverage the FDP software to stream data via the OSDF services for their fusion data analysis.

The FDP is funded by the DOE under award DE-SC0024426.

Public gravitational wave data from international gravitational wave network, including data from LIGO, VIRGO, and KAGRA. This data can be used in the detection and study of black holes throughout the universe.

These datasets are the calibrated readouts from the corresponding interferometers. Also included are mirrors of data analysis products released to Zenodo to accompany publications.

The IceCube repository integrates data from the IceCube Neutrino Observatory, a cubic-kilometer detector embedded deep in Antarctic ice near the South Pole. IceCube records when high-energy neutrinos interact with the ice.

Using over 5,000 optical sensors deployed between 1,450 and 2,450 meters below the surface, the observatory captures detailed information about these events, including their timing, location, and intensity. The data is used to study cosmic neutrinos and the astrophysical phenomena that produce them, such as black holes, supernovae, and gamma-ray bursts.

The IceCube collaboration is supported by multiple funding agencies including the NSF. The dataset is maintained by the Wisconsin Icecube Particle Astrophysics Center.

User-managed data by members of the LIGO Scientific Collaboration, the Virgo Collaboration, and the KAGRA Collaboration. These data are created and used within individual users’ workflows as they analyze gravitational-wave data in order to detect black hole collisions and other cosmic phenomena. This origin is hosted at Caltech.

This data is not public; it is in support of in-progress computational workflows.

Gravitational wave data collected by the KAGRA interferometer, a scientific device for detecting gravitational waves in the Gifu prefecture in Japan. KAGRA collaborates closely with the LIGO detectors in the US to provide more accurate detection of gravitational waves

This is the data not yet released to the public.

Gravitational wave data collected by the LIGO interferometer detectors in Hanford, Washington and Livingston, Louisiana and hosted by LIGO Laboratory at Caltech. Gravitational wave data is used to detect black hole collisions and other cosmic phenomena and is one piece of the NSF’s multi-messenger astronomy initiatives.

This is the data not yet released to the public.

Curated datasets used by members of the LIGO Scientific Collaboration, the Virgo Collaboration, and the KAGRA Collaboration in the combined analysis of data collected from their detectors. These data consist of gravitational-wave data collected at any of the four interferometers but with simulated signals, as well as some other datasets, used for data analysis purposes in detecting black hole collisions and other cosmic phenomena as part of the NSF’s multi-messenger astronomy initiatives.

These data are not yet released to the public.

This is a test repository utilized by staff of the LIGO Laboratory at Caltech to test new versions ofthe Pelican software and configuration, to ensure that upcoming changes do not disrupt ongoing data analysis on any of the production origins. This test origin specifically tests the software and configuration of user-managed data analogous to that served in /igwn/cit.

This data is private.

This is a test namespace utilized by staff of the LIGO Laboratory at Caltech to test new versions of Pelican software and configuration, to ensure that upcoming changes do not disrupt ongoing data analysis on any of the production origins.

This data is private.

Gravitational wave data collected by the VIRGO interferometer, a scientific device for detecting gravitational waves near Pisa, Italy. VIRGO collaborates closely with the LIGO detectors in the US to provide more accurate detection of gravitational waves

This is the data not yet released to the public.

Jessica Kendall-Bar leads a research group that integrates engineering, data science, ecology, and visual storytelling/public communication to explore the behavior and physiology of marine life.

Her visual data work has appeared in various media platforms—from UC San Diego news to national outlets like The New York Times and The Atlantic—and has contributed to global policy efforts in areas such as marine mammal protection and coral reef recovery.

Jessica Kendall-Bar leads a research group that integrates engineering, data science, and ecology to explore the behavior and physiology of marine life. The data stored on the OSDF includes high-resolution multimodal data such as video, GPS, and electrophysiology.

The OSDF data is catalogued on the National Data Platform, enabling textual, conceptual, and map-based spatiotemporal search capabilities.

The NDP project is using this dataset as inputs for a data challenge planned for Fall 2025. It also powers an application running on the National Research Platform at https://lifeinthedeep.nrp-nautilus.io/.

The Jefferson National Laboratory (JLab) operates particle accelerator facilities and associated detectors for experiments like GlueX.

JLab connects its storage to the OSDF to allow large-scale data simulation and reprocessing on the PATh-operated OSPool resources and JLab-provided capacity.

This repository enables faculty and students at Kennesaw State University to use their NSF Campus Cyberinfrastructure (CC*) funded storage (Award #2430289) with their local HPC cluster via OSDF.

The Knight Lab uses and develops state-of-the-art computational and experimental techniques to ask fundamental questions about the evolution of the composition of biomolecules, genomes, and communities in different ecosystems, including the complex microbial ecosystems of the human body.

The MeerKAT Absorption Line Survey (MALS) consists of 1,655 hours of observatory time on the MeerKAT radio telescope at the South African Radio Astronomy Observatory. The survey aims to carry out the most sensitive search of HI and OH absorption lines at 0<z<2, the redshift range over which most of the cosmic evolution in the star formation rate density takes place.

The MALS dataset is replicated to the OSDF to allow collaborators at the NRAO participate in the scientific study of the data.

General namespace for University of Missouri OSStore contribution.

Research in machine learning methods like deep learning neural networks, computer vision and morphological neural networks.

The LLC4320 ocean dataset is the product of a 14-month simulation of ocean circulation and dynamics using the Massachusetts Institute of Technology’s General Circulation Model on a lat-lon-cap grid. Comprising extensive scalar data such as temperature, salinity, heat flux, radiation, and velocity, the dataset exceeds 4 PB and can potentially improve our understanding of global ocean circulation and its role in Earth’s climate system.

In order to make this dataset more accessible and easier to visualize, the National Science Data Fabric has processed the raw data into the ViSUS data format using their OpenViSUS toolsuite.

It will be used in the 2026 IEEE SciVis Contest to demonstrate cutting-edge technologies for working with petascale climate data provided by NASA.

The NASA C1440-LLC2160 dataset is the simulation output from research into coupling two models: a global atmospheric model and a global ocean model that were originally designed to be run separately. The atmospheric model is a C1440 configuration of the Goddard Earth Observing System (GEOS) atmospheric model running on a cubed-sphere grid. The global ocean model is an LLC2160 configuration of the MITgcm model that uses a latlon-cap grid. Each model was run for over 10000 hourly timesteps covering over 14 simulation months. With more than 10000 time steps and multiple scalar fields, it totals approximately 1.8 PB.

In order to make this dataset more accessible and easier to visualize, the National Science Data Fabric has processed the raw data into the ViSUS data format using their OpenViSUS toolsuite.

It will be used in the 2026 IEEE SciVis Contest to demonstrate cutting-edge technologies for working with petascale climate data provided by NASA.

NCAR provides a wide range of atmospheric and Earth system science datasets, including observational data from airborne and ground-based instruments, outputs from community weather models, and large-scale reanalysis and simulation data. These datasets support research on weather patterns, the water cycle, and extreme weather events. They are used by researchers, educators, and policymakers across the US.

Integrated with the OSDF is NCAR’s Research Data Archive (RDA), the centrally managed archive of the laboratory’s atmospheric and Earth system datasets. When downloading data from the web interface, users are automatically redirected to the OSDF cyberinfrastructure.

Example notebooks that analyze data from these datasets can be found in the NCAR OSDF Examples repository and are part of the NCAR effort to utilize the OSDF.

Visualizations

Visualization of climate data over South America on October 10, 2020, using NCAR datasets.

Climate over South America on October 10, 2020


Visualization of ocean temperature on January 16, 2014.

Ocean heat on January 16, 2014

The integration between NCAR and OSDF is part of the Pathfinders collaboration, a collaboration between five initiatives aimed at developing science-led pathways through the NSF cyberinfrastructure landscape. This work is funded by NSF award 1852977.

A century of suppressing wildfires has created a dangerous accumulation of flammable vegetation on landscapes, contributing to megafires that risk human life and destroy ecosystems. Prescribed burns can dramatically reduce the risk of large fires that are uncontrollable by decreasing this buildup of fuels. BurnPro3D is a science-driven, decision-support platform to help the fire management community understand risks and tradeoffs quickly and accurately when planning and conducting prescribed burns.

A century of suppressing wildfires has created a dangerous accumulation of flammable vegetation on landscapes, contributing to megafires that risk human life and destroy ecosystems. Prescribed burns can dramatically reduce the risk of large fires that are uncontrollable by decreasing this buildup of fuels. BurnPro3D is a science-driven, decision-support platform to help the fire management community understand risks and tradeoffs quickly and accurately when planning and conducting prescribed burns.

NOAA collects and uses active acoustic (or sonar) data for a variety of mapping requirements. Water column sonar data focus on the area from near the surface of the ocean to the seafloor. Primary uses of these specific sonar data include 3-D mapping of fish schools and other mid-water marine organisms; assessing biological abundance; species identification; and habitat characterization. Other uses include mapping underwater gas seeps and remotely monitoring undersea oil spills. NCEI archives water column sonar data collected by NOAA line offices, academia, industry, and international institutions.

Radio astronomy data from the Very Large Array Sky Survey (VLASS).

As written in the VLASS homepage, VLASS is a survey of the universe through the use of the Very Large Array (VLA) in New Mexico. The VLA is one of the most sensitive telescopes in the radio band that can provide more sensitive images of the universe than any other radio telescope in the world. This, however, requires processing large volumes of data and super-computer class computing resources. The VLASS is designed to produce a large collection of radio data available to wide range of scientists within the astronomical community. VLASS’s science goal is to produce a radio, all-sky survey that will benefit the entire astronomical community. As VLASS completes its three scans of the sky separated by approximately 32 months, new developments in data processing techniques will allow scientists an opportunity to download data instantly on potentially millions of astronomical radio sources.

The data in this data origin consists of interferometric visibilities stored in (Measurement Set (MS)) format. Each dataset contains calibrated visibilities for one of the sixteen spectral windows of the VLA and covers an area of 4 square degrees (2 degrees x 2 degrees) in the sky. All sixteen spectral windows are combined to generate a single image, so that the data contained in this data origin can be used to make images of approximately 70 regions in the sky, each image covering 4 square degrees. The LibRA software package is used to transform visibilities to images. The architecture and design considerations for LibRA are shown in this presentation.

Teams of scientists at the National Radio Astronomy Observatory (NRAO), Socorro, NM and the Center for High Throughput Computing (CHTC) have used the PATh and NRP facilities of the OSG to make the deepest image in the radio band of the Hubble Ultra-deep Field (HUDF). Similarly, the COSMOS HI Large Extra Galactic Survey (CHILES)[http://chiles.astro.columbia.edu/] project has 1000 hr of integration with the VLA on the COSMOS field. Imaging the CHILES data using PATh and NRP facilities delivered the deepest radio image of this region of the sky, at an unmatched data processing throughput. Similarly to the VLASS data stored in this data origin, the data for HUDF and CHILES is stored in the PATh facility data origin. These recent large scale imaging achievements that were made possible through use of OSG resources are reported in this [NRAO Newsletter article] (https://science.nrao.edu/enews/17.3/index.shtml#deepimaging) and this press release.

Namespace used by Fabio for ongoing CheckMK testing of NRP caches

The XENON Dark Matter Project is a scientific collaboration organized around the XENONnT dark matter detector at the INFN Gran Sasso National Laboratory in Gran Sasso, Italy.

This repository is used to store data and simulations from the XENONnT experiment to aid in its computing workloads.

Scripps Institution of Oceanography scientists conduct fundamental research to understand and protect the planet, and investigate our oceans, Earth, and atmosphere to find solutions to our greatest environmental challenges.

Datasets for use in OSDF usage tutorials by Pelican Platform facilitation team.

This repository supports the education and workforce development mission of the Pelican Project.

Staging area for PATh-operated Access Points located at the University of Chicago.

The PATh project allows researcher teams to stage their research data to an object store connected to the OSDF and then process and analyze the data using the OSDF via the OSPool. Any US-based open science team can utilize the PATh services for distributed High Throughput Computing workflows.

This data is organized as “working datasets” representing running workloads, not permanent scientific outputs.

Staging area for PATh-operated Access Points located at the University of Wisconsin-Madison.

The PATh project allows researcher teams to stage their research data to an object store connected to the OSDF and then process and analyze the data using the OSDF via the OSPool. Any US-based open science team can utilize the PATh services for distributed High Throughput Computing workflows.

This data is organized as “working datasets” representing running workloads, not permanent scientific outputs.

Staging area for PATh-operated collaboration services located at the University of Chicago.

The PATh project allows multi-institutional collaborations to stage their experimental data and simulation outputs to an object store connected to the OSDF and then process and analyze the data using the OSDF via the OSPool or other capacity dedicated to their experiment.

This data is organized as “working datasets” representing running workloads, not permanent scientific outputs.

Data staging area for OSPool projects with public data

Staging area for data used in the PATh Facility. The PATh Facility is a distributed computing resource spanning 5 sites, from San Diego, California to Syracuse, New York, that provides NSF-funded researches with compute credits for High Throughput Computing workflows.

This repository enables these NSF projects to stage their research data outputs to an object store connected to the OSDF and then process and analyze the data using the OSDF via both the PATh Facility computing hardware and the OSPool.

This data is organized as “working datasets” representing active workloads from researchers, not permanent scientific outputs.

Special projects data in the PATh facility.

To avoid redundancy, focus on /path-facility/data instead.

A namespace for the Pelican Platform facilitation team to use for a variety of facilitation purposes.

Testing and Validation Origin

The Dark Energy Survey (DES) will probe the origin of the accelerating universe and help uncover the nature of dark energy by measuring the 14-billion-year history of cosmic expansion with high precision. A 570M-pix camera, the DECam, is being built for this project and comprehensive tests were successfully accomplished at Fermilab’s telescope simulator (pictured above). As we countdown to DECam’s first light, workload and excitement increase among our collaborators. Starting in late 2011 and continuing for five years, DES will survey a large swath of the southern sky out to vast distances in order to provide new clues to this most fundamental of questions.

DES uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.

The Deep Underground Neutrino Experiment is an international flagship experiment to unlock the mysteries of neutrinos. DUNE scientists will paint a clearer picture of the universe and how it works. Their research may even give us the key to understanding why we live in a matter-dominated universe — in other words, why we are here at all.

DUNE will pursue three major science goals: find out whether neutrinos could be the reason the universe is made of matter; look for subatomic phenomena that could help realize Einstein’s dream of the unification of forces; and watch for neutrinos emerging from an exploding star, perhaps witnessing the birth of a neutron star or a black hole.

DUNE uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.

The ICARUS neutrino detector measures 65 feet long and weighs 760 tons. It began its life in Gran Sasso Laboratory in Italy, seeking out elusive particles using pioneering technology. It later spent two years undergoing upgrades at CERN, the European particle physics laboratory and home of the Large Hadron Collider. It moved to Fermilab in 2017 and was installed in its detector hall in 2018, where along with the new Cosmic Ray Tagger it forms the far detector for the Short-Baseline Neutrino program.

The ICARUS collaboration is investigating signs of physics that may point to a new kind of neutrino called the sterile neutrino. Other experiments have made measurements that suggest a departure from the standard three-neutrino model. ICARUS is also investigating the various probabilities of a neutrino interacting with different types of matter as well as neutrino-related astrophysics topics.

ICARUS uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.

MINERvA (Main Injector Neutrino ExpeRiment to study v-A interactions) is the first neutrino experiment in the world to use a high-intensity beam to study neutrino reactions with five different nuclei, creating the first self-contained comparison of interactions in different elements. While this type of study has previously been done using beams of electrons, this is a first for neutrinos.

MINERvA is providing the world’s best, high-precision measurements of neutrino interactions on various nuclei, in the 1-to 10-GeV energy range. MINERvA’s results are being used as inputs to current and future experiments seeking to study neutrino oscillations, or the ability of neutrinos to change their type.

MINERvA uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.

The NOvA (NuMI Off-axis ve Appearance) experiment is shedding light on one of nature’s most elusive particles: neutrinos. Since the late 1990s, physicists have known that neutrinos exhibit a quantum mechanical behavior called oscillations. But this behavior is not predicted by the Standard Model of particle physics. NOvA is working to better understand these strange particles through precision measurements of their oscillation properties.

NOvA uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.

The international Short-Baseline Neutrino Program at Fermilab examines the properties of neutrinos, specifically how the flavor of a neutrino changes as it moves through space and matter. The program emerged from a joint proposal, submitted by three scientific collaborations, to use particle detectors to perform sensitive searches for ve appearance and νμ disappearance in the Booster Neutrino Beam. All of the detectors are types of liquid-argon time projection chambers, and each contributes to the development of this particle detection technology for the long-baseline Deep Underground Neutrino Experiments (DUNE).

SBN uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.

The Short-Baseline Near Detector (SBND) is a 112-ton active mass liquid argon time projection chamber (LArTPC) neutrino detector that sits only 110-m from the target of the Booster Neutrino Beam (BNB) at Fermilab. SBND is the near detector in the Short-Baseline Neutrino Program. ICARUS is the far detector in the program, and MicroBooNE ran previously in the same beam.

SBND will record over a million neutrino interactions per year. By providing such a high statistics measurement of the un-oscillated content of the BNB, SBND plays a critical role in performing searches for neutrino oscillations at the SBN Program. The large data sample will also allow studies of neutrino-argon interactions in the GeV energy range with unprecedented precision. The physics of these interactions is an important element of future neutrino experiments that will employ the LArTPC technology, such as the long-baseline Deep Underground Neutrino Experiment, DUNE.

SBND uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.

MicroBooNE is a large 170-ton liquid-argon time projection chamber (LArTPC) neutrino experiment located on the Booster neutrino beamline at Fermilab. The experiment first started collecting neutrino data in October 2015.

MicroBooNE investigates the low energy excess events observed by the MiniBooNE experiment, measure a suite of low energy neutrino cross sections, and investigate astro-particle physics.

MicroBooNE uses the OSDF to deliver common data inputs for large-scale simulation jobs distributed across the US.

General namespace for Purdue University OSStore contribution.

The RouteViews dataset provides a map of the Internet, as seen by participating sites. The information, collected from the BGP tables of routers, includes both current and historic “snapshots”. This allows operators of major Internet services to detect changes to the map in near-real time and for researchers to understand the historical evolution of the Internet.

The RouteViews dataset is funded by University of Oregon’s Advanced Network Technology Center, and by grants from the National Science Foundation, Cisco Systems, the Defense Advanced Research Projects Agency, Juniper Networks, Sprint Advanced Technology Laboratories, Catchpoint and the providers who graciously provide their BGP views.

The Sage project provides a platform for AI computing at the edge. It operates a nationwide infrastructure of distributed sensors - from urban landscapes to remote mountainsides - that collect, process using AI techniques, and aggregate data.

With over 100 Sage nodes deployed across 17 states, including fire-prone regions in the Western U.S., the platform supports rapid-response science and sustained observation of ecological systems, agriculture, urban environments, and weather-related hazards.

Sage uploads its data into NSF CC* funded storage systems connected to the OSDF. Data access requires a Sage account; more information can be found in the Sage documentation and tutorials.

The SPIn4D project (Spectropolarimetric Inversion in Four Dimensions with Deep Learning) develops neural networks to help prepare for the huge amount of solar data coming from the NSF-funded Inouye Solar Telescope, the most powerful solar telescope in the world.

SPIn4D’s data release one is 109TB of simulated small-scale dynamo actions accompanying the project’s first paper. A corresponding Jupyter notebook illustrates how to access and use the data via the OSDF using the Pelican clients. The dataset is also accessible via the National Data Platform.

For more information, see the accompanying spotlight article.

The KoaStore repository is a high performance and scalable parallel file system storage solution that can be used by University of Hawai’i faculty and staff. KoaStore was funded through the NSF Campus Cyberinfrastructure program through award #2232862.

KoaStore users provide datasets such as SPIN4D accessible via the OSDF.