Study program of the Master's Degree in Big Data Science. University of Navarra - University Master's Degree in Big Data Science

texto-asignaturas

Modules of the Study program

Each subject is part of a module.

Publicador de contenidos

CALENDAR 25-26 (PDF)

asignaturas-desplegable

module I. Programming and Computing

Python for data analysis (5 ECTS credit)

Syntax and Structures of data
Data storage and manipulation
Numpy, Pandas, Matplotlib and Seaborn Libraries
Projects

datainstructions 1,5 ECTS credit)

instructions from data relational
- model entity relationship
- Standardization
- SQL
Data acquisition
- OLAP
- Internet as source for data
- exchange of information
Distributed storage
- Blockchain
- Hadoop (HDFS + MapReduce)
- Real-time processing
instructions of NoSQL data
- Type
- MongoDB
Google Cloud Platform
- Compute
- Cloud SQL
- BigTable
- DataStore
- BigQuery

data visualization (2 ECTS credit)

General visualisation concepts
Storytelling with data
Commercial platforms for visualisation

data Collection Techniques (1,5 ECTS credit)

Data Management
- Master Data Management (MDM)
- Extraction of data in environments similar to business (SQL, Hive)
Web scraping
Images
Social networks

Big Data Techniques (4 ECTS credit)

Computer architecture. Cloud Computing. Cloud Infrastructure. OCI
Analytical SQL. Oracla Autonomous Database: ADW, ATP, JSON
Big Data Cloud Products
AWS Certification - Cloud Practitioner

module II. data analysis

statistical analysis of data (8 ECTS credit)

Review of probability, random variables and hypothesis testing
Multidimensional random variables. Joint density and mass. Conditional distributions. Covariance and correlation. Expectation of a random vector, variance and covariance matrix. Independence of random variables
Analysis of variance
Multiple linear regression and logistic regression
Lasso and Ridge Regression
Principal component analysis
Time Series
Classification models
Grouping techniques

data preparation and cleaning (2 ECTS credit)

Exploratory data analysis
Pre-processing of data
Noise and outlier detection
Processing of missing values
Treatment of the unbalanced problem
Structured and unstructured data
Evaluation of the distributions of variables

Machine Learning (5 ECTS credit)

Introduction to Machine Learning
Types of learning: supervised, unsupervised, semi-supervised and reinforcement learning.
Frequentist vs. Bayesian models. Parametric vs. non-parametric models.
Inference vs. prediction. Overfit vs underfit. Bias vs variance
Data processing, missing values and imputation. Feature engineering, feature importance and explainability
Markov chains, Naive Bayes and rules (sequence analysis and association analysis)
Instance based models (kNN), LDA, SVM, tree based models (decision tree, bagging trees, random forest...) and regularization.
Clustering techniques (kMeans, hierarchical...) and dimensional transformation (Isomap, t-SNE, SOM, SVD, PCA...)
Network analysis: spectral clustering, node centrality, bipartite network, co-citation, bibliographic coupling
Survival analysis: censored and truncated data, Kaplan-Meier estimator, log-rank test...
Ensemble learning methods (sequential and parallel ensemble techniques)
MLOPs
Case studies: recommender system, time series and Datathon

Deep Learning (4 ECTS credit)

Fundamentals of neural networks. Architectures, activation and loss functions, layers and optimization of hyper-parameters and models.
Neural networks for data tabular: classification, regression and time series
Natural language processing. Text classification and document clustering
Image processing
Transfer learning
Reinforcement learning
Cloud processing. Parallel training of a network and deploy models as a service.

Throughout the subject, you will deploy and integrate Generative AI models adapted to your applications and projects. You will use their full potential to produce personalized content and learn the art of prompt engineering. You will also transform LLMs with your own data and learn the most advanced techniques to create effective prompts.

Module III. Projects

The Master aims to provide a solid training in terms of technical knowledge, but also a business vision, so that once the Master is completed, students can act as a bridge between the executive and technical levels of a project. In this way, they will be taught by professionals from leading companies and multinationals, practical and successful cases, seeking to apply concepts acquired in the first two modules. In addition, we have the collaboration of IESE Business School, the Business School of the University of Navarra.

projectmanagement and business vision (8 ECTS credit)

Project planning: identification, definition and objectives
Agile Methodologies
Privacy and transparency. Ethics of artificial intelligence
Generative AI projects
Applications

Workshops with companies (1 ECTS credit)

exhibition of examples and use cases by experts from renowned companies in various sectors. Tools and techniques taught during the program are addressed through real and current projects.

Module IV. Master's Thesis

It plays an important role in the program. A practical approach is sought that at the same time provides solutions to real problems and projects proposed by companies with which there are agreements at partnership. It can be co-directed both by these companies and by academics from the University of Navarra, and is an excellent opportunity for students to lead the implementation of projects with an impact on their professional environment.

work End of Master's Degree (18 ECTS credit)

The TFM will consist of an original work in which the competences acquired during the Master's Degree must be put in internship . It can be done in groups and developed in the framework of a business or institution that proposes a project of collection, cleaning, preparation, advanced analytics of data and visualization of the results. It can also be done through a project of entrepreneurship in this field.

Ethical aspects of data processing, as well as the economic and social impact of the results, should be highlighted. The student must demonstrate that they know how to plan a project and carry it out in a real working environment, in such a way that they acquire a very practical experience in the field of Data Science and Big Data.

teccnologias-logos

During the Master's Degree University in Big Data Science you will work with the most demanded technologies and tools nowadays. You will use programming languages and software such as Python, Anaconda, Jupyter, RStudio, SQL, Databricks, Git, PySpark, Scikit Learn, Keras or TensorFlow, as well as the visualization platforms data Tableau or PowerBI. Upon completion, you will be certified as an AWS Cloud Practitioner.

In addition, you will deploy and integrate Generative AI models adapted to your applications and projects. You will use its full potential to produce personalized content and learn about the art of prompt engineering. You will also transform LLMs with your own data and learn the most advanced techniques to create effective prompts .

Visualización del menú

Visor de contenido web (Global)

Portada_Foto_PlanEstudios

Study Program

Aplicaciones anidadas

texto-asignaturas

Modules of the Study program

Publicador de contenidos

asignaturas-desplegable

teccnologias-logos

PlandeEstudios_Enlaces

YOU MAY BE INTERESTED IN

Visor de contenido web (Global)

REGULATIONS

Visualización del menú

Ruta de navegación

Visor de contenido web (Global)

Portada_Foto_PlanEstudios

Study Program

Aplicaciones anidadas

texto-asignaturas

Modules of the Study program

Publicador de contenidos

asignaturas-desplegable

teccnologias-logos

PlandeEstudios_Enlaces

YOU MAY BE INTERESTED IN

Visor de contenido web (Global)

REGULATIONS