2022/2023 KAN-CDSCV1008U Applied Machine Learning and Data Engineering in Business Context
English Title | |
Applied Machine Learning and Data Engineering in Business Context |
Course information |
|
Language | English |
Course ECTS | 7.5 ECTS |
Type | Elective |
Level | Full Degree Master |
Duration | One Semester |
Start time of the course | Autumn |
Timetable | Course schedule will be posted at calendar.cbs.dk |
Max. participants | 60 |
Study board |
Master of Science (MSc) in Business Administration and Data
Science
|
Course coordinator | |
|
|
Main academic disciplines | |
|
|
Teaching methods | |
|
|
Last updated on 10-02-2022 |
Relevant links |
Learning objectives | ||||||||||||||||||||||||
|
||||||||||||||||||||||||
Course prerequisites | ||||||||||||||||||||||||
The students are expected to have strong familiarity and a good understanding of Data Mining and Machine Learning concepts. It is expected that the students should have completed a proper machine learning course before taking this course. | ||||||||||||||||||||||||
Prerequisites for registering for the exam (activities during the teaching period) | ||||||||||||||||||||||||
Number of compulsory
activities which must be approved (see section 13 of the Programme
Regulations): 2
Compulsory home
assignments
Each student has to get 2 out of 3 home assignments approved in order to participate in the ordinary exam. All 3 mandatory activities are made in groups. In the first mandatory assignment, the students will download a dataset, choose a suitable algorithm/data analytical method to analyse the dataset and argue why the chosen algorithm/method is suitable for the purpose in a business-friendly manner (max 5 pages). In the second mandatory assignment, the students will develop an end-to-end machine-learning architecture for their data analytics project (from the first mandatory assignment) in the cloud using one of the Azure or AWS cloud infrastructures (max 5 pages). The third mandatory assignment is focussed on developing a 10-slide executive PowerPoint presentation that mainly targetted for communicating the machine-learning approach and data engineering architectures to the business audience and the management using the key principles of business communication. There will not be any extra attempts provided to the students before the ordinary exam. If a student cannot participate due to documented illness, or if a student does not get the activities approved in spite of making a real attempt, then the student will be given one extra attempt before the re-exam: one individual home assignment (10 pages) which will make up for two mandatory activities. |
||||||||||||||||||||||||
Examination | ||||||||||||||||||||||||
|
||||||||||||||||||||||||
Course content, structure and pedagogical approach | ||||||||||||||||||||||||
This course aims at making some of the complex mathematical, cloud and data science concepts tangible to the business audience. The primary focus of the course content is to explain data mining, machine learning theories and cloud & data engineering concepts in an animated and business-friendly fashion where students are motivated to understand deeply the assumptions of the models and evaluate their applicability for a given business context.
The course provides hands-on experience on how to use machine-learning methods for solving real-world problems in an organizational context using suitable cloud technologies and business communication practices. It simulates the real-world processes that are experienced by the data scientists in the companies to provide a hands-on experience to the students on data-driven decision-making in an organizational setting. This course is ideal for the students who have a strong technical background in machine-learning and looking forward to enriching their skills on data engineering, cloud technologies, and business communication to have a smooth transition into the data scientist/data engineer careers at a later point of time. This is course is offered in collaboration with Capgemini and other Danish companies and therefore, the students will have an opportunity to interact with domain experts from various industry sectors such as Finance, Marketing, Supply chain and Energy.
The course is structured in three parts, providing the students with a full overview of methods, techniques, and practices that are currently used in the industry with respect to data-driven decision-making in an organization setting as follows.
1. Machine Learning in Business
2.Data Engineering and Cloud Technologies (using Amazon Web Services and Microsoft Azure cloud platforms)
3.Business Communication Practices
|
||||||||||||||||||||||||
Description of the teaching methods | ||||||||||||||||||||||||
This course is a blended-learning course. Some of the lectures and exercises will be delivered online but there will be some activities, especially few lectures and hands-on exercise workshops will be conducted on campus. The hands-on exercises will be offered in Python/R programming languages. The cloud platforms such as Amazon Web Services and Microsoft Azure will be used for providing hands-on experience on end-to-end cloud architectures and automation of machine-learning processes. In addition to the above, there will be several presentations by the domain experts, data scientists and data engineers working in the Danish industry. | ||||||||||||||||||||||||
Feedback during the teaching period | ||||||||||||||||||||||||
In-class and hands-on exercises will be used
systematically to test students’ understanding of the course
content and increase their ability to reproduce acquired knowledge
and skills autonomously. Students will receive continuous in-class
feedback on them.
As part of the course, the students will have to take 3 mandatory assignments, out of which one will be a business presentation exercise. The students will receive feedback on these mandatory activities. Moreover, feedback on the hands-on exercises will be also provided in the classroom. |
||||||||||||||||||||||||
Student workload | ||||||||||||||||||||||||
|
||||||||||||||||||||||||
Expected literature | ||||||||||||||||||||||||
Due to the rapidly evolving nature of the field, the reading list will be updated and has to be consulted at the start of the semester. Students are advised to check the syllabus on Canvas before buying any material.
The teaching material will include
Some suggested textbooks for reference:
|