Accelerated
Provide access to hardware-based accelerators such as Graphics Processing Units (GPUs) or Field Programmable Gate Arrays (FPGAs). [Source: AWS]
Analytics
Big data is the often complex process of examining large and varied , or big data, to uncover information -- such as hidden patterns, unknown correlations, market trends and customer preferences -- that can help organizations make informed business decisions. [Source: TechTarget]
Analytics Services
Work with data faster to extract value from your data. [Source: AWS]
Application Performance Management
Application Services
(often used instead of application or application services management) are a pool of services such as load balancing, application performance , application acceleration, autoscaling, micro‑segmentation, service proxy and service discovery needed to optimally deploy, run and improve applications. [Source: Avi]
Authentication and Authorization
Authentication is a key mechanism for information security that establish proof of identities to get access of information in the system. Authorization is an important identity service to avoid unauthorized access to cloud resources. [Source: Pratiba, et al]
Auto-generated Models
Automated (AutoML) is the process of automating the process of applying to real-world problems. AutoML covers the complete pipeline from the raw dataset to the deployable model. AutoML was proposed as an artificial intelligence-based solution to the ever-growing challenge of applying . [Source: Wikipedia]
Batch Data Processing
is an efficient way of processing high volumes of data is where a group of transactions is collected over a period of time. Data is collected, entered, processed and then the batch results are produced (Hadoop is focused on batch data processing). Batch processing requires separate programs for input, process and output. [Source: Data Science Central]
Beam
A multi-cloud governance service that provides cost , accounting, optimization, and security capabilities for CloudBank.
Big Data & Analytics
Big data is a combination of structured, semistructured and unstructured data collected by organizations that can be mined for information and used in projects, predictive modeling and other advanced applications. [Source: TechTarget]. Big data is the often complex process of examining large and varied , or big data, to uncover information -- such as hidden patterns, unknown correlations, market trends and customer preferences -- that can help organizations make informed business decisions.[Source: TechTarget]
Billing Account
A
Block Storage
Block is an approach to data in which each volume acts as an individual hard drive that is configured by the administrator. In the block model, data is saved to the media in fixed-sized chunks called blocks. Each block is associated with a unique address, and the address is the only metadata assigned to each block. [Source: TechTarget]
Browser Console
CB
CILogon
A federated identity management service that enables researchers to use their home organization identities to access research applications, rather than requiring yet another username and password to log on. More Information.
Cold Storage
Cold is a computer system or mode of operation designed for the retention of inactive data. [Source: TechTarget]
Command Line Interface
Accessing Cloud resources, tools, and services from a command-line interface.
Compute Optimized
optimized instances are ideal for compute-bound applications that benefit from high-performance processors. [Source: AWS]
Connectivity & Control Services
Secure, control, and manage your devices from the cloud. [Source: AWS]
Console & APIs
Accessing Cloud services from and console.
Containers
A container is a standard unit of software that packages up code and all its dependencies so the application runs quickly and reliably from one computing environment to another. [Source: Docker]
Content Delivery Network
A content delivery , or content distribution (CDN), is a geographically distributed of proxy servers and their data centers. [Source: Wikipedia]
Conversational Interface
Data Sets
A data set is a collection of related, discrete items of related data that may be accessed individually or in combination or managed as a whole entity. [Source: TechTarget]
Database Services
A database can be defined as a shared collection of interrelated data designed to meet the varied information needs of an organisation. [Source: McFadden and Hoffer]
Dedicated Interconnect
provides direct physical connections between your on-premises and a Cloud's . [Source: Google]
Deep Cold Storage
Deep cold is a location for data that will probably not be accessed again, but must be kept in case of a compliance audit or some other business reason. [Source: TechTarget]
Deployment
Cloud refers to the enablement of SaaS (software as a service), (platform as a service) or (infrastructure as a service) solutions that may be accessed on demand by end users or consumers. [Source: Atos]
Device
Specialized hardware for devices.
Device Software
Connect your devices and operate them at the edge. [Source: AWS]
Document Database
A is a type of nonrelational database that is designed to store and query data as JSON-like documents. [Source: Amazon]
Domain and DNS
The Domain Name System (DNS) is a hierarchical and decentralized naming system for computers, services, or other resources connected to the Internet or a private . [Source: Wikipedia]
Extract, Transform, Load
(ETL) is the general procedure of copying data from one or more sources into a destination system which represents the data differently from the source(s) or in a different context than the source(s). [Source: Wikipedia]
File Storage
File , also called file-level or file-based , stores data in a hierarchical structure. [Source: TechTarget]
Flexible Shape
Free Notebook Hosting
No charge for .
Fully Managed
A service provides developers and data scientists with the ability to build, train, and deploy (ML) models quickly. [Source: AWS]
Fund
An allocation of dollars to a PI and one or more Co-PIs for a specific project or award to be spent on one or more Public Clouds. For NSF awardees, a CloudBank
Funder
An entity (e.g., agency or university) that banks money at CloudBank for use on Public Clouds. A
General Purpose
instances provide a balance of , memory, and networking resources, and can be used for a variety of workloads. [Source: AWS]
Graph Database
A (GDB) is a database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. [Source: Wikipedia]
HDD
A computer hard disk drive ( ) is a non-volatile memory hardware that controls the positioning, reading and writing of the hard disk, which furnishes data . [Source: TechTarget]
High Throughput Computing
tasks that are easily broken into parallel non-communicating jobs to be disseminated across multiple virtual machines.
Hot Storage
High durability, availability, and performance object for frequently accessed data [Source: AWS]
IAAS
Infrastructure as a service ( ) is a form of cloud computing that provides virtualized computing resources over the internet. [Source: TechTarget]
IAM
Identity and Access Management (
In-Memory Database
An is a type of nonrelational database that relies primarily on memory for data , in contrast to databases that store data on disk or SSDs. In-memory databases are designed to attain minimal response time by eliminating the need to access disks. [Source: Amazon]
Key-Value Database
A is a type of nonrelational database that uses a simple key-value method to store data. A stores data as a collection of key-value pairs in which a key serves as a unique identifier. [Source: Amazon]
Ledger Database
A NoSQL database that provides an immutable, transparent, and cryptographically verifiable transaction log owned by a central authority. [Source: Medium]
Load Balancer
Load balancing is defined as the methodical and efficient distribution of or application traffic across multiple servers in a server farm. Each sits between client devices and backend servers, receiving and then distributing incoming requests to any available server capable of fulfilling them. [Source: Citrix]
Machine Learning
(ML) is the scientific study of algorithms and statistical models that computer systems use to perform a specific task without using explicit instructions, relying on patterns and inference instead. It is seen as a subset of artificial intelligence. [Source: Wikipedia]
Managed Batch Computing
Batch processing is defined as the processing of a finite amount of data without interaction or interruption. [Source: Spring.io]
Management Services
Services to help you manage your cloud resources.
Marketplace
A cloud provides customers with access to software applications and services that are built on, integrate with or complement the cloud provider's offerings. A typically provides customers with native cloud applications and approved apps created by third-party developers. [Source: TechTarget]
Memory Optimized
instances are designed to deliver fast performance for workloads that process large in memory. [Source: AWS]
Messaging
is the exchange of message s (specially-formatted data describing events, requests, and replies) to a messaging server , which acts as a message exchange program for client programs. [Source: TechTarget]
Monitoring
Cloud is the process of evaluating, , and managing cloud-based services, applications, and infrastructure. [Source: Stackify]
Natural Language Processing
, usually shortened as NLP, is a branch of artificial intelligence that deals with the interaction between computers and humans using the natural language. [Source: Medium]
Network
Networking is the interconnection of multiple devices, generally termed as Hosts connected using multiple paths for the purpose of sending/receiving data or media. [Source: GeeksforGeeks]
Notebook Hosting
The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, data visualization, , and much more. [Source: Jupyter]
Object Storage
Object , also called object-based , is an approach to addressing and manipulating data as discrete units, called objects. Objects are kept inside a single repository, and are not nested as files inside a folder inside other folders. [Source: TechTarget]
PAAS
Platform as a service ( ) is a cloud computing model in which a third-party provider delivers hardware and software tools -- usually those needed for application development -- to users over the internet. [Source: TechTarget]
Programming APIs
Accessing Cloud resources, services, and tools from .
Quantum Computing
Relational Database
A is a type of database that stores and provides access to data points that are related to one another. Relational databases are based on the relational model, an intuitive, straightforward way of representing data in tables. [Source: Oracle]
Serverless
computing is a cloud computing execution model in which the cloud provider runs the server, and dynamically manages the allocation of machine resources. Pricing is based on the actual amount of resources consumed by an application, rather than on pre-purchased units of capacity. [Source: Wikipedia]
Speech
recognition is an interdisciplinary subfield of computational linguistics that develops methodologies and technologies that enables the recognition and of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). [Source: Wikipedia]
Spot pricing
Cloud consumers bid on spare resources and employ them whenever the bid exceeds the current spot price, while the employed service will be interrupted when the spot price exceeds the current bid. [Source: Li, et. al.]
Storage
Cloud is a service model in which data is transmitted and stored on remote systems, where it is maintained, managed, backed up and made available to users over a (typically the internet). [Source: TechTarget]
Storage Optimized
optimized instances are designed for workloads that require high, sequential read and write access to very large on local storage. They are optimized to deliver tens of thousands of low-latency, random I/O operations per second (IOPS) to applications. [Source: AWS]
Stream Data Ingest
Data ingestion is the transportation of data from assorted sources to a medium where it can be accessed, used, and analyzed by an organization. The destination is typically a data warehouse, data mart, database, or a document store. [Source: Stitch]
Stream Data Processing
Stream Processing is a Big data technology. It is used to query continuous data stream and detect conditions, quickly, within a small time period from the time of receiving the data. The detection time period varies from few milliseconds to minutes. [Source: Medium]
Time Series Database
A (TSDB) is a software system that is optimized for storing and serving time series through associated pairs of time(s) and value(s). [Source: Wikipedia]
Translation
Machine (MT) is a subfield of computational linguistics that is focused on translating text from one language to another. [Source: Medium]
Video Intelligence
Video content analysis (also video content , VCA) is the capability of automatically analyzing video to detect and determine temporal and spatial events. [Source: Wikipedia]
Virtual Networks
A virtual connects virtual machines and devices, no matter their location, using software. In a physical , layer 2 and 3 functions of the OSI model happen within physical switches and routers. [Source: Vmware]
Vision
Computer , often abbreviated as CV, is defined as a field of study that seeks to develop techniques to help computers “see” and understand the content of digital images such as photographs and videos. [Source: Machine Learning Mastery]
Warm Storage
For data that is accessed less frequently, but requires rapid access when needed [Source: AWS]
Workflow Orchestration
empowers you to author, schedule, and monitor pipelines that span across clouds and on-premises data centers. [Source: Azure]