MC Which statement is CORRECT? The graph theoretic center is the node with the highest minimum distance to all other nodes. incorrect The closeness is always higher than the betweenness. incorrect The betweenness counts the number of the times that a node or edge occurs in the geodesics of the network. correct The geodesic represents the longest path between two nodes. incorrect MC To guarantee maximum independence and organizational impact of analytics, it is important that... the Chief Data Officer (CDO) or Chief Analytics Officer (CAO) reports to the CIO or CFO. incorrect analytics is supervised only locally in the business units. incorrect a Chief Data Officer or Chief Analytics officer is added to the executive committee who directly reports to the CEO. correct the CIO takes care of all analytical responsibilities. incorrect MC Given the following decision tree:



According to the decision tree, an applicant with Income > $50.000 and High Debt=Yes is classified as: Good Risk incorrect Bad Risk correct MC Which statement is CORRECT? All given success factors of an analytical model, i.e. relevance, performance, interpretability, efficiency, economical cost and regulatory compliance, are always equally important. incorrect Divisive hierarchical clustering starts from all observations in individual clusters and merges the ones that are most similar until all observations make up a single cluster. incorrect The betweenness counts the number of the times that a node or edge occurs in the geodesics of the network. correct When using on premise solutions, maintenance or upgrade projects may even go by unnoticed. incorrect MC Which statement is CORRECT? Data stewards can request data scientists to check or complete the value of a field. incorrect Data owners are the data quality experts who are in charge of assessing data quality by performing extensive and regular data quality checks. incorrect Quality of data is key to the success of any analytical exercise since it has a direct and measurable impact on the quality of the analytical model and hence its economic value. correct Data preprocessing activities such as handling missing values, duplicate data or outliers are preventive measures for dealing with data quality issues. incorrect MC Which of the following is not an advantage of open source software for analytics? It is available for free. incorrect It has been thoroughly engineered and extensively tested, validated and completely documented. correct A world-wide network of developers can work on it. incorrect It can be used in combination with commercial software. incorrect MC OLAP (On-Line Analytical Processing) can help in which of the following steps of the analytics process? Data transformation incorrect Data collection incorrect Data denormalizatio incorrect Data visualization correct MC Which statement is CORRECT? An important advantage of cloud based solutions concerns the scalability and economies of scale offered. More capacity (e.g. servers) can be added on the fly whenever needed. correct On premise solutions catalyze improved collaboration across business departments and geographical locations. incorrect When using on premise solutions, maintenance or upgrade projects may even go by unnoticed. incorrect The big footprint access to data management and analytics capabilities is a serious drawback of cloud based solutions. incorrect MC Consider a data set with a multiclass target variable as follows: 25% bad payers, 25% poor payers, 25% medium payers and 25% good payers. In this case, the entropy will be: Minimal incorrect Maximal correct MC What statement about the adjacency matrix representing a social network is not true? It is a symmetric matrix. incorrect It has the same number of rows and columns. incorrect It is sparse since it contains a lot of non-zero elements. correct It can include weights. incorrect