Past Projects

The American Red Cross Safe and Well Website is a service provided by the American Red Cross (ARC) for use by victims of disasters in the US, and their loved ones. For individuals affected by a disaster, this website provides a way to register themselves as “safe and well.” From a list of standard messages, the individual can select those that they want to communicate to their family members... more

Analytical WorkBench for Social Media Analytics (AWESOME) is a polystore-based system to support social data analytics. The AWESOME polystore can support relational, semistructured, graph and text data and houses a Spark computation engine. A salient feature of the system is that it is designed for handling cross-model derived data that are produced in course of the analysis process.

Be There San Diego is a coalition of patients, communities, healthcare systems, and others working together to reduce and prevent heart attacks and strokes in our community.  More than 70% of heart attacks and strokes can be prevented through healthy lifestyles and appropriate treatment.  Be There San Diego is preventing heart attacks and strokes through a multi-faceted education and... more

With “big data” becoming a major force of innovation across enterprises of all sizes, new platforms for managing big data sets are being announced with some regularity, with increasingly more features.

The objective of this project is to develop an end-to-end application-layer benchmark for big data applications to enable ranking of big data systems according to a well-defined,... more

The Big Data Top 100 List initiative is an open community-based effort for benchmarking big data systems.

The objective is to develop an end-to-end application-layer benchmark for big data applications to enable ranking of big data systems according to a well-defined, verifiable/audited performance metric, with an accompanying efficiency metric.

With “big data” becoming a major... more

The California 3D project will process a California state-wide 3D orthophotography dataset and prepare it for visualization on advanced displays in Calit2’s Immersive Visualization Lab, including the StarCAVE and HIPerSpace systems. The ability to view the entire state in 3D, including changes over time, provides a paradigm shift for a number of Catli2’s applications in the Environment thrust... more

The objective of the CloudStor project is to explore new strategies and technologies for data-intensive cloud computing; investigate application profiles that benefit from this paradigm; and, develop corresponding applications. The CloudStor group is interested in evaluating the performance and price/performance of alternative, dynamic strategies for provisioning data intensive... more

The CONNECT Innovation Report provides an overview of the strength and impact of the innovation economy by tracking the health of the San Diego innovation economy.  CONNECT provides a comparison of tech industry data in selected regions and monitors the availability of various types of capital.  SDSC contributes to CONNECT by collecting and processing data from various resources for analysis... more

CyberGIS represents a new generation of GIS based on seamless synthesis of cyberinfrastructure, geographic information science, and spatial analysis and modeling. The NSF funded CyberGIS project advances the science of CyberGIS, with a particular focus on enabling the analysis of big spatial datasets, computationally intensive spatial analysis and modeling, and collaborative geospatial problem... more

In this project, UCSD researchers from Calit2 CWPHS and the SDSC ACID group are working with cancer researchers at the M.D. Anderson Cancer Center in Houston, Texas to develop a comprehensive, state-of-the-art cyberplatform to enable large-scale and robust comparative effectiveness research across the neoplastic continuum, i.e., from cancer... more

Virtually all SDSC activities involve national and/or international collaborations and partnerships with individuals, communities and institutions both inside and outside SDSC.

The cyberinfrastructure provided by SDSC provides a tremendous attractor for computational and data-oriented scientists. SDSC currently is a leader/co-leader/participant in more than 80 grants and contracts.... more

DELPHI is a platform that enables integrated access and analysis of all data relevant to health. This platform promotes a more rapid development of empowering, data-driven health apps and tools by a broad community of health-related software developers.

The US Geoscience Information Network (GIN) is a system of state and federal geological survey online data providers and user applications linked together by a collection of shared web services and interchange formats for the purpose of finding, accessing, and using geoscientific information.

The objective of the GIN project is to develop standardized services to make data resources... more

GEON started in 2002 as a collaborative research project among a dozen PI institutions, funded by the NSF Information Technology Research (ITR) program, to develop cyberinfrastructure for Earth Science data sharing and integration. However, much of the core GEON cyberinfrastructure is generic and broadly applicable beyond Earth Sciences and Geosciences and, indeed, has been leveraged by many... more

The NEES Cyberinfrastructure Center (NEESit) is a service-focused organization created to deliver information technology tools and infrastructure to enable earthquake engineers to remotely participate in experiments, perform hybrid simulations, organize and share data, and collaborate with colleagues.


The broad goal of the I2T project was to develop XML-based data mediation technology for integrating geospatial and statistical data, which is of great interest to a number of statistical and other government agencies. The research participants included the San Diego Supercomputer Center and Computer Science Department at the University of California, San Diego, and the Inter-University... more

IDSE offers an education and training program covering a broad set of topics in Data Science through a curriculum that incorporates foundational topics as well as hands-on with specific tools and technologies.

In 2004, the National Institute of General Medical Sciences (NIGMS) established the Modeling of Infectious Disease Agent Study as a collaborative network of research scientists who use computational, statistical and mathematical models to understand infectious disease dynamics and thereby assist the nation to prepare for, detect and respond to infectious disease threats.

ACID... more

Founded in 1978, the UCSD Moores Cancer Center is one of just 40 centers in the United States to hold a National Cancer Institute (NCI) designation as a Comprehensive Cancer Center. It is one of the leading institutions in the nation conducting basic and clinical cancer research, and in providing advanced patient care. The Cancer Center is one of five Organized Research Units (ORU) of the UCSD... more

The National Ecological Observatory Network (NEON) will collect data across the United States on the impacts of climate change, land use change and invasive species on natural resources and biodiversity. NEON is a project of the U.S. National Science Foundation, with many other U.S. agencies and NGOs cooperating.

NEON will be the first observatory network of its kind designed to detect... more

The National Laboratory for Advanced Data Research (NLADR) is a collaborative research and development activity in advanced data technologies between the San Diego Supercomputer Center at the University of California, San Diego, and the National Center for Supercomputing Application (NCSA) at University of Illinois.

NLADR's mission is to address the challenges facing research... more

The objective of the NMR Portal project is to develop a central location for the data collected by researchers and students using any of the NMR facilities at UCSD. Creating a central site for all of this data makes it easier for researchers to access their data at a later time after experiments are completed, and facilitates data sharing between PIs and their groups and among collaborating... more

Open Science Chain utilizes distributed ledger technology (consortium blockchain) to securely store information about scientific data including its provenance to enable independent verification of its authenticity to establish trust in the research community.

Sher Dataspace is a cloud-based service to facilitate progressive structuring, curation, and on-going use of research data, primarily by the individual or small-group researcher.

The goal of the TEAM Network project is to scale up ecological studies to global proportions. This ambitious initiative is devoted to monitoring long-term trends in biodiversity and by establishing networks of tropical field stations and standardized methods of data collection so that scientists anywhere on Earth can quantify at the pace at which we are saving tropical ecosystems. In that... more

TeraGrid was an open scientific discovery infrastructure combining leadership class resources at eleven partner sites to create an integrated, persistent computational resource. Using high-performance network connections, TeraGrid integrated high-performance computers, data resources and tools, and high-end experimental facilities around the country.