Ruta de navegación

Aplicaciones anidadas

Formación_Microcursos_GestionDatosR_ContenidoWebBasico_Titulo_TextoPrincipal

Data management with R

R is one of the most widely usedprogramming languages in scientific research, machine learning, data mining and financial mathematics. One of its strengths compared to other programming languages is its more than 2,000 libraries, which provide a wide range of highly developed statistical and graphical tools.

In this course we will explore some of these libraries to solve regression, time series, machine learning, etc. problems. As a preliminary step, the reading and writing of data files will be covered, as well as the manipulation, preparation and cleaning of data necessary to subsequently apply the corresponding algorithms.

Formacion_Microcursos_GestionDatosR_Desplegable_Info

The course is aimed at Degree and Master's students from the University of Navarra and other universities interested in approaching the area of Data Science with basic knowledge of computers and spreadsheets.

As this is an introductory course, no previous knowledge of programming in R or other software is required.

Installation of R and RStudio is required to follow the course. Preferably on Windows operating system.

The contents of the course are detailed below:

  • Introduction to R and RStudio

The first part of the course will be dedicated to basic aspects of R, such as the installation of the working environment, the creation of projects, the basics of programming in R and the handling of libraries.

  • Statistics with R

R is a language particularly suitable for statistical programming. For this reason, this part will be devoted to explaining the descriptive statistics tools it presents, as well as the most common statistical graphics (histograms, boxplots, etc.).
We will also study the possibilities that R has for the treatment of time series, focusing on ARIMA models for forecasting.

  • Reading, writing and manipulating data

Special emphasis will be put on reading and writing data in .csv and .xlsx files, data frame operations and data preparation and cleaning.

  • Introduction to MachineLearning with R

Simple machine learning algorithms will be introduced as examples. In particular, regression algorithms will be dealt with, using the libraries that R has developed for this purpose.

  • Advanced graphics with ggplot

This library gives us more flexibility than R base for the graphic part. The possibilities it offers will be explored and it will also be used to make interactive graphics.

  • Rmarkdown

As a complement to the rest of the course, there will be a short introduction to this tool that allows us to present the results or the code developed in R in a simple way.

The course is delivered synchronously online. It lasts 3 weeks, with theoretical and practical classes and tutorials lasting 2 hours.

The schedule course is as follows: Mondays, Wednesdays and Fridays from 14.00 to 16.00 h.

From 4 to 18 October, the following dates:

  • Session 1: Monday, 4 October

  • Session 2: Wednesday, 6 October

  • Tutorial: Friday, 8 October

  • Session 3: Monday, 11 October

  • Session 4: Wednesday, 13 October

  • Tutorial: Friday, 15 October

  • Session 5: Monday, 18 October

In the sessions, exercises will be proposed for students to work on their own, given that personal work is fundamental in programming. These exercises will then be solved in the tutorial sessions, where doubts about the exercises and the sessions will be resolved.

The evaluation will be carried out by means of 3 questionnaires subject test with questions on the topics covered in the sessions. The first will be on sessions 1 and 2, the second on sessions 3 and 4 and the third on session 5. You will have several days to answer each questionnaire and you will be able to ask your doubts about them in the tutorial sessions.

The final grade of the course will be calculated as the arithmetic average of the 3 questionnaires. At the end of the course, an accrediting certificate will be sent to those who have passed the course.

All explanation sessions will be conducted in the Zoom application and will be recorded and made available to the students.

grade: a minimum of attendance will be required to attend the sessions to be considered C.

The price of the course is 350€, with various discounts available:

  • University of Navarra students, 15% discount: 297,50€.

  • large family, 8% discount: 322€.

The steps to formalise enrollment in the course are as follows:

1.Fill in and send the pre-registration form form before Wednesday, 22 September 2021 at 14.00h. 

2. An email will be sent on Friday 24th Septemberinforming you whether or not you have been admitted to the course.

3. In the case of having been admitted, you will be provided with the link to the payment gateway in order to formalise the enrollment. It is necessary that the enrollment is formalised before Friday 1 October at 14.00h, otherwise your place will be sent to another candidate.

The delivery of the course will be subject to a minimum number of students.

Formacion_Microcursos_GestionDatosR_Enlaces_Flyer