A
API
Stands for Application Programming Interface. It is a set of functions that allow to access to data from an application or an operating system.
Stands for Application Programming Interface. It is a set of functions that allow to access to data from an application or an operating system.
Artificial intelligence
Field of science that refers to computer systems that perform tasks typically performed by human intelligence, such as speech recognition, problem-solving, or decision-making.
Field of science that refers to computer systems that perform tasks typically performed by human intelligence, such as speech recognition, problem-solving, or decision-making.
B
Big Data
Term that refers to large and complex datasets from various sources that can't be analysed by traditional tools.
Term that refers to large and complex datasets from various sources that can't be analysed by traditional tools.
C
CSV
A file format in which the records are arranged in a table, with each item of information separated by a comma.
A file format in which the records are arranged in a table, with each item of information separated by a comma.
D
Data Fabric
Architectural approach that integrates and manages access to data from multiple sources within the organisation.
Architectural approach that integrates and manages access to data from multiple sources within the organisation.
Data Hub
Centralised platform that provides a single point of access to data from multiple sources and simplifies collaboration and information sharing.
Centralised platform that provides a single point of access to data from multiple sources and simplifies collaboration and information sharing.
Data Integration
The process of transforming data from multiple sources to have a consolidated view for analysis.
The process of transforming data from multiple sources to have a consolidated view for analysis.
Data Lake
Data storage that holds large amounts of unstructured, structured or semi-structured raw data. It is typically used for machine learning processing or big data analysis.
Data storage that holds large amounts of unstructured, structured or semi-structured raw data. It is typically used for machine learning processing or big data analysis.
Data Mesh
Decentralised architectural approach that organises data within domains according to its use.
Decentralised architectural approach that organises data within domains according to its use.
Data Mining
Process of analysing large amounts of data raws from different angles to identify potential connections between databases.
Process of analysing large amounts of data raws from different angles to identify potential connections between databases.
Data Set
Structured collection of data records, generally organised in a tabular form with rows and columns.
Structured collection of data records, generally organised in a tabular form with rows and columns.
Data Warehouse
Data storage that contains structured data records designed to support business decision-making through data analysis with a consolidated view.
Data storage that contains structured data records designed to support business decision-making through data analysis with a consolidated view.
Deep Learning
Subset of Machine Learning that incorporates the use of neural networks inspired by the human brain to learn from large amounts of data.
Subset of Machine Learning that incorporates the use of neural networks inspired by the human brain to learn from large amounts of data.
E
ELT
Stands for Extraction Load Transform .It is a data integration process that loads data after extraction before transforming it directly into the data storage
Stands for Extraction Load Transform .It is a data integration process that loads data after extraction before transforming it directly into the data storage
ETL
Stands for Extraction Transform Load. It is a data integration process that transforms data after extraction before loading it into the data storage.
Stands for Extraction Transform Load. It is a data integration process that transforms data after extraction before loading it into the data storage.
F
Fine-Tuning
Process of programming a pre-trained model for specific tasks.
Process of programming a pre-trained model for specific tasks.
FTP
Protocol to transfer files between computer systems
Protocol to transfer files between computer systems
J
JSON
A file format in which records are organised as objects. It is inspired by the JavaScript programming language.
A file format in which records are organised as objects. It is inspired by the JavaScript programming language.
M
Machine Learning
The field of artificial intelligence that involves developing computer systems that can learn and adapt without explicit instruction.
The field of artificial intelligence that involves developing computer systems that can learn and adapt without explicit instruction.
Metadata
Data that describes data information, such as relational databases and details.
Data that describes data information, such as relational databases and details.
N
NLP
Stands for Neural Language Processing. It's an AI field that uses machine learning techniques to help computer systems understand natural language. It is particularly used in the context of generative AI.
Stands for Neural Language Processing. It's an AI field that uses machine learning techniques to help computer systems understand natural language. It is particularly used in the context of generative AI.
P
Python
Programming language generally used to perform analysis using algorithms.
Programming language generally used to perform analysis using algorithms.
R
Raw Data
Data that has not been manipulated and is in its native format.
Data that has not been manipulated and is in its native format.
S
Semi-Structured Data
Data with some organisational levels without a generally defined structure.
Data with some organisational levels without a generally defined structure.
SQL
Acronym of Structured Query Language. It is a programming language used to query, manipulate and interact with databases for analysis
Acronym of Structured Query Language. It is a programming language used to query, manipulate and interact with databases for analysis
Structured Data
Data that is organised and stored in a specific format to ease the readability for data analysis
Data that is organised and stored in a specific format to ease the readability for data analysis
U
Unstructured Data
Data that have not a predefined format and lacks a fixed schema, making it difficult to store in traditional relational databases
Data that have not a predefined format and lacks a fixed schema, making it difficult to store in traditional relational databases