Web Content Viewer (Global)
Modules of the Study program
Each subject is part of a module.
Python for data analysis (5 ECTS credit)
- Syntax and Structures of data
- Data storage and manipulation
- Numpy, Pandas, Matplotlib and Seaborn Libraries
instructions from data (2 ECTS credit)
- instructions from data relational
- model entity relationship
- Data acquisition
- Internet as source for data
- exchange of information
- Distributed storage
- Hadoop (HDFS + MapReduce)
- Real-time processing
- instructions of NoSQL data
- Google Cloud Platform
- Cloud SQL
Display (2 ECTS credit)
- General visualisation concepts
- Storytelling with data
- Commercial platforms for visualisation
Collection techniques from data (2 ECTS credit)
- Data Management
- Master Data Management (MDM)
- Extraction of data in environments similar to business (SQL, Hive)
- Web scraping
- Social media
Big Data Techniques (3 ECTS credit)
- Computer architecture. Cloud Computing. Cloud Infrastructure. OCI
- Analytical SQL. Oracla Autonomous Database: ADW, ATP, JSON
- Big Data Cloud Products
statistical analysis from data (8 ECTS credit)
- Review of probability, random variables and hypothesis testing
- Multidimensional random variables. Joint density and mass. Conditional distributions. Covariance and correlation. Expectation of a random vector, variance and covariance matrix. Independence of random variables
- Analysis of variance
- Multiple linear regression and logistic regression
- Lasso and Ridge Regression
- Principal component analysis
- Time Series
- Classification models
- Grouping techniques
Preparation and cleaning of data (2 ECTS credit)
- Exploratory data analysis
- Pre-processing of data
- Noise and outlier detection
- Processing of missing values
- Treatment of the unbalanced problem
- Structured and unstructured data
- Evaluation of the distributions of variables
Machine Learning (6 ECTS credit)
- Introduction to Machine Learning
- Types of learning: supervised, unsupervised, semi-supervised and reinforcement learning.
- Frequentist vs. Bayesian models. Parametric vs. non-parametric models.
- Inference vs. prediction. Overfit vs underfit. Bias vs variance
- Data processing, missing values and imputation. Feature engineering, feature importance and explainability
- Markov chains, Naive Bayes and rules (sequence analysis and association analysis)
- Instance based models (kNN), LDA, SVM, tree based models (decision tree, bagging trees, random forest...) and regularization.
- Clustering techniques (kMeans, hierarchical...) and dimensional transformation (Isomap, t-SNE, SOM, SVD, PCA...)
- Network analysis: spectral clustering, node centrality, bipartite network, co-citation, bibliographic coupling
- Survival analysis: censored and truncated data, Kaplan-Meier estimator, log-rank test...
- Ensemble learning methods (sequential and parallel ensemble techniques)
- Case studies: recommender system, time series and Datathon
Deep Learning(3 ECTS credit)
- Fundamentals of neural networks. Architectures, activation and loss functions, layers and optimization of hyper-parameters and models.
- Neural networks for data tabular: classification, regression and time series
- Natural language processing. Text classification and document clustering
- Image processing
- Transfer learning
- Reinforcement learning
- Cloud processing. Parallel training of a network and deploy models as a service.
The Master aims to provide a solid training in terms of technical knowledge, but also a business vision, so that once the Master is completed, students can act as a bridge between the executive and technical levels of a project. In this way, they will be taught by professionals from leading companies and multinationals, practical and successful cases, seeking to apply concepts acquired in the first two modules. In addition, we have the collaboration of IESE Business School, the Business School of the University of Navarra.
management of projects and business vision (5 ECTS credit)
- Project planning: identification, definition and objectives
- Agile Methodologies
- Privacy and transparency. Ethics of artificial intelligence
- Generative AI projects
Workshops with companies (4 ECTS credit)
exhibition of examples and use cases by experts from renowned companies in various sectors. Tools and techniques taught during the program are addressed through real and current projects.
It plays an important role in the program. A practical approach is sought that at the same time provides solutions to real problems and projects proposed by companies with which there are agreements at partnership. It can be co-directed both by these companies and by academics from the University of Navarra, and is an excellent opportunity for students to lead the implementation of projects with an impact on their professional environment.
work End of Master's Degree (18 ECTS credit)
The TFM will consist of an original work in which the competences acquired during the Master's Degree must be put in internship . It can be done in groups and developed in the framework of a business or institution that proposes a project of collection, cleaning, preparation, advanced analytics of data and visualization of the results. It can also be done through a project of entrepreneurship in this field.
Ethical aspects of data processing, as well as the economic and social impact of the results, should be highlighted. The student must demonstrate that they know how to plan a project and carry it out in a real working environment, in such a way that they acquire a very practical experience in the field of Data Science and Big Data.
During the Master's Degree University in Big Data Science you will work with the most demanded technologies and tools nowadays. You will use programming languages and software such as Python, Anaconda, Jupyter, RStudio, SQL, Git, PySpark, Scikit Learn, Keras or TensorFlow, as well as the visualization platforms of data Tableau or PowerBI. Upon completion, you will be certified as an AWS Cloud Practitioner.