Data Science and Machine Learning

Field Course (1060) … Why you should (not) take this class

Showcase

What you have to expect

Content

  • Coding in R, JavaScript (and Python)
  • Why still learn to code in the age of LLMs?

Webscraping

  • Extracting data from websites efficiently
  • Reading out static pages with Rvest
  • Control browser with RSelenium

Data Visualization

  • Principles of graphical integrity and excellence
  • The grammar of graphics

Machine Learning

  • Introduction into supervised learning
    • Decision tree learning
    • Bias-variance trade-off
    • Confusion matrix, ROC- and PR-curves
    • Cross validation
  • Neural networks
  • Natural language processing

Formalities

What you need to know

Field Course

  • Schedule and resources in index
  • Dates and rooms in VVZ
  • Grading
    • Assignments (40%)
      • Groups of four
      • In class presentations of solutions (random)
    • Exam (60%)
      • Pen and paper

Seminar

  • Kujtim Avdiu Fr 15:00-17:00
  • Practical implementations using R
  • Group project, presentation and active participation
  • Details in VVZ

Prerequisites

  • Coding skills in R or a similar language (e.g., Python, Julia)
  • Basic understanding of probability, statistics, linear algebra and calculus
  • Practical understanding of statistical modeling and experience in working with data