Modulinformationssystem Informatik

 

Data Science URL PDF XML

Modulcode: infDaSci-01a
Englische Bezeichnung: Data Science
Modulverantwortliche(r): Prof. Dr. Matthias Renz
Turnus: jedes Jahr im WS (WS21/22 WS22/23 WS23/24 WS24/25)
Präsenzzeiten: 3V 1Ü
ECTS: 5
Workload: 45 h lectures, 15 h exercises, 90 h self studies
Dauer: ein Semester
Modulkategorien: BSc-Inf-A (BSc Inf (21)) BSc-WInf-WP-WInf (BSc WInf (21)) WI (BSc Inf (15)) 2F-MEd-Inf-WP (MEd-Hdl Inf (21)) 2F-MA-Inf-WP (2F-MA Inf (21)) MSc-WInf-WP-WInf (MSc WInf (21)) WI (MSc Inf (15)) WI (MSc WInf (15)) NF (Inf. als NF) INF-Math (Inf. als NF) INF-VWL (Inf. als NF) Arch-NFInf21 (Inf. als NF) EcoQuantFin (Export)
Lehrsprache: Englisch
Voraussetzungen: Info Inf-Math-A Inf-Math-B

Kurzfassung:

The lecture is intended to convey the basics for the presentation, processing and use of data to gain (new) knowledge and derive recommendations for action. The most important aspects of the life cycle of data are addressed, starting with data formats and structures, which play an important role in the collection and management of the data, through methods for processing and using the data, through to the representation and communication of the data knowledge and knowledge gained.

Lernziele:

The students

  • understand the term "data science" and its meaning (context)
  • know the common data formats, data structures and data models
  • know the most important models (from statistics) for describing data sets (data collections), their data quality and metadata.
  • know the most basic methods of data (pre) processing and basic introduction to methods for knowledge acquisition (machine learning, data mining, knowledge discovery, ...)
  • understand fundamental aspects of the interpretation and presentation of the results from the data processing
  • can apply the learned techniques to simple practical Data Science applications

Lehrinhalte:

  1. Data acquisition, data collection procedures and data models
  2. Statistical data description and data exploration (frequencies, graphical representation of data, description of distributions, concentration measures, univariate and multivariate data descriptors, box plots, correlation analysis, hypothesis test)
  3. Data cleaning (handling of missing / noisy data, interpolation, extrapolation, regression analysis, kriging, smoothing)
  4. Data integration and transformation (redundancy analysis, correlation analysis, chi-square test, smoothing, dimension reduction, feature extraction (spatial, temporal, multimedia), data cubes, index structures)
  5. Introduction to methods for searching in data (exact match, similarity search, kNN, ...)
  6. Introduction to methods for analyzing data (DM, ML, ect.)
  7. Data visualization (basics)

Weitere Voraussetzungen:

Prüfungsleistung:

Written exam

Prerequisits for the exam: home work

Lehr- und Lernmethoden:

In the lecture, the material is conveyed in different forms (blackboard, projector), which are selected depending on the respective content. For the most part of the lecture there are slides, which are made available together with other documents on the website of the event. Theoretical and practical tasks related to the subject matter taught in the lecture are dealt with in the exercises. The solutions are discussed.

Verwendbarkeit:

Students who did not pass this module as a mandatory module in their Bachelor, can choose this module as an elective module within their Bachelor or Master studies.

Literatur:

  1. Statistik: Der Weg zur Datenanalyse (Fahrmeir, Künstler, Pigeot, Lutz)
  2. Data Mining: Concepts and Techniques (Jiawei Han, Micheline Kamber)
  3. Interactive Data Visualization: Foundations, Techniques, and Applications (Ward, Grinstein, Keim)

Verweise:

Kommentar:

The lecture will start in winter term 21/22.