Wednesday, January 22, 2025
A- A A+

M. Sc. Data Science and Analytics

Objectives of the Program :

In the emerging data driven world, Data Science and Data Analytics are receiving much acceptance in the national and international levels for making wise decisions. Data analytics is  an essential field that brings together Data, technology, information, statistical analysis all in one platform. Every organization in private/ public sector creates a large volume of data from almost every area of daily activity. Analyzing such data has huge potential to predict the future of the organization. A good amount of knowledge is very necessary in the field of data management, machine learning, natural language processing as they are the key factors in Data Science. The decision to establish a School of Data Analytics along with the School of Mathematics & Statistics in the Mahatma Gandhi University, Kottayam during 2020-21 is a most appropriate decision and will help the research and career opportunities for prospective students.

In the proposed curriculum for M.Sc. Data Science and Analytics, apart from teaching core subjects, the students are also offered training in programming languages and open softwares such as R, Python, etc to handle real life problems through the practical classes. As part of the program the students are also given training in computing softwares like SPSS, SAS etc. The program is so designed that on successful completion, the students would be able to pursue research or higher studies in the areas of Data Science, Data Analytics, Statistics, Mathematics, Computer Science, Economics, Management and allied fields. Moreover, emerging areas like Bayesian Inference and Computing, Stochastic Modeling, Time Series Forecasting, Internet of Things and Fraud Analytics are included in the curriculum. There will be an open course during third semester to encourage interdisciplinary studies and research. This can be selected from among courses offered by any other school/department/centre in the university. The following are some of the emerging areas of applications in this respect.

  1. Internet of Things (IoT): Analytics tools and techniques for dealing with the massive amounts of structured and unstructured data generated by IoT will continue to gain importance.
  2. Finance and Banking: Creating newer business models or frameworks that leverages the available data to facilitate financial institutions to monetize data to deliver superior customer values.
  3. Data Security: Fraud Security Analytics are already transforming intrusion detection, differential privacy, digital watermarking and malware countermeasures.
  4. Health Care: Health Care Analytics and Predictive Analytics enable the examination of patterns in various healthcare data in order to determine how clinical care can be improved while limiting excessive spending.

There has been much interest in Bayesian Inference in Data Analytics in recent years. It is a way to get sharper predictions from the data, particularly when there is not much data available and when one want to juice every last bit of information from it. During the last three decades, Actuarial Science has gone through revolutionary changes due to the implementation of high speed computers and modern theory. It applies mathematical and statistical methods to assess risk in insurance, finance and other industries. Official Statistics make information on economic and social development accessible to the public, allowing the impact of government polices to be assessed and thus improving accountability.
With the emergence of new diseases like AIDS, SARS, COVID 19 etc. more interest has come to analyze such epidemic data to study the dynamics of its spread. In the modern data driven world Data Science and Data Analytics are the principal tools for making wise decisions and drawing valid conclusions. We expect students to select the elective courses to ensure employment and research opportunities in emerging areas. The main thrust of the program will be to enable students to understand basic concepts and applications through real data sets using computer programs like R and Python in the data analytics lab rather than just imparting theory. Industry interaction, case studies and projects will be main components of all courses.

  1. Eligibility for admissions:

B.Sc. Degree in Mathematics or Statistics or Computer Science main / B.Sc. (triple main) with Mathematics, Statistics and Computer Science as main subjects or BCA, B.Sc./B.Voc. Data Science or Data Analytics, B.E./ B. Tech.( CS or IT) with at least 50% marks (CGPA 5.0 out of 10.00 under grading system) for the optional subjects taken together, provided the candidate has studied at least 2 courses in Probability / Statistics at degree level.
Admission will be made through a common admission procedure (CAP) on the basis of a Common Admission Test (CAT) or a special test conducted for specific programmes by the Departments, as the case may be. Admission may be based on the written test alone or written test and interview or on the basis of the marks obtained in the qualifying examinations as well as the marks obtained in the written test, the interview as decided by the Faculty Council of Schools / Centres / Institutes from time to time in accordance with CSS regulations 2020.

3. Examination : Credit and Semester System (CSS)
4. Medium of instruction and assessment : English
5. Duration of the Course : 4 Semesters (2 years)

This is a regular program of study in which no private / distance mode will not be conducted. However, under extreme situations like COVID 19 pandemic, classes may be conducted online as per UGC guidelines and University regulations . The duration of PG program shall be 4 semesters. The duration of each semester shall be 90 working days (18-20 weeks) including internal and external examinations. Odd semesters are normally from July to December and even semesters from January to June. A student shall be permitted to register for the program at the time of admission. A student who registers for the course shall complete the course within a period of 4 years from the date of commencement of the program.

6. Courses and Credits:

Every Program conducted under Credit Semester System shall be monitored by the Departmental Academic Committee. In all the programmes, three kinds of courses are offered; Core Courses (3-4 credits), Elective Courses (2-4 credits) and Open elective courses (4 credits). Core courses are offered by the Schools/Department/Centre/Institute conducting the programme. Elective Courses shall be selected either from the same School/Department or from some other School / Centres / Institutes. Any course chosen by a student, generally from an unrelated discipline/subject, from Schools/ Centres / Institutes other than own School/Department/Center, with an intention to seek broad exposure, is called an Open course. Students are required to take one open course in the Third semester. The details are given in Table of Courses and Credits.

A Semester shall be worth a minimum of 16 credits and maximum of 24 credits. The total credits for a postgraduate program shall be between 80 to 88. The maximum credits for core courses, electives and open courses in postgraduate programmes shall be 64, 20 and 4 respectively. A minimum of 4 credits and maximum of 20 credits shall be set apart for the project work/dissertation. The compulsory project/dissertation to be completed in the 4th semester of a postgraduate programme shall be prepared by the student under the guidance of a member of the faculty or, in the case of subjects, which so demand, an external guide, to be decided by the school‘s faculty council. University Departments / Schools are permitted to offer online UGC approved MOOC courses from SWAYAM platform as electives during 3rd and 4th semesters subject to the condition that the aggregate credits for such online courses shall not exceed 20% of total credits. These shall be coordinated by a faculty coordinator subject to the approval of the faculty council.

7. Evaluation and Grading

The CSS rules and regulations 2020 for University Departments and Schools will be applicable. Evaluation scheme for each course shall contain 2 parts (a) Internal Continuous Assessment (CA) and (b) End Semester Assessment (SA). 40% weightage is given to Continuous internal Assessment and 60 % to End Semester external Assessment. Both internal and external evaluation is carried out in accordance with the grading system as given in the CSS regulations. Normally odd semester ESA is through examiners in the department. But for even semesters, external examiners will also be included in the board of examiners and there will be double valuation. Questions will be mainly of applied nature rather than just theory.

8. Faculty under which the Degree is awarded : Science

9. Specializations offered if any : List of Electives enclosed in item 13.

10. Note on compliance with UGC Minimum Standards for the conduct and award of Post Graduate Degrees:

Present syllabus is in compliance with UGC Minimum Standards to award Post Graduate Degree. It is ideal if one enjoys Mathematics and Statistics and would like to use his skills to model future events and risk.

11. Objectives and Outcomes of the Program:

Objectives:


(i) To provide advanced level teaching and training in theory and applications of Data Science and Analytics as well as skills in computer programming and data analytics and interpretation.(ii) To provide a platform for talented students to become leaders in this discipline by undergoing higher studies in the subject as well as to train them to suit for the needs of the society as job providers by establishing startups and business.
(iii) To allow more flexibility to branch out into other emerging areas of Statistics, Computer Science, Data Science and Data Analytics.
(iv) To draw together a variety of subject areas to enable students to model real-world data from various contexts by exploring a blend of Applied Mathematics and Statistics with appropriate computing tools including free softwares like R, Python etc.
(v) To provide special attention to interdisciplinary areas of research and applications in describing, exploring, analyzing and comparing data with an innovative research mind in a data driven world.
(vi) To make them familiar with emerging developments in Data Science and Big Data Analytics and their applications in various areas.
(vii) To provide the students on the job training in industrial applications and professional development with a view to enable them to get opportunities for teaching, research and employment in India and abroad through industry academia collaborations and linkages with reputed research institutes and industries in India and abroad.

Outcomes:


(i) After undergoing this program, students will get advanced knowledge in theory and applications in all areas of Data Science, Data Analytics, Statistical Learning, Machine Learning, Data Base Management, Artificial Intelligence, etc.

(ii) Students have secured practical skills in statistical methods and computer programming to plan and execute projects and decision making using Data Science, Data Analytics, Machine Learning etc.

(iii) Students are well equipped to undertake any work involving exploratory data analysis, fraud analytics, data learning, text mining etc. as future entrepreneurs.

(iv) Students have developed skills in advanced computing softwares like R and Python for big data analytics, computing and data analysis.

(v) Students are well trained to take up jobs in reputed firms and MNCs etc as Data Scientists, Data Analysts, Data Engineers, Risk Analysts, Business Analysts, Financial Analysts, Decision Makers, Entrepreneurs etc.

(vi) Students are motivated to pursue teaching and research in all emerging areas of research in theoretical and applied branches of Data Science, Data Analytics and related areas.

  1. The Program Structure: M. Sc. Data Science & Analytics
Table of Courses and Credits

 

Course Code Course Title Teaching

 

L+T+P

Credits
SEMESTER I Total Credits 21    
SDA MP C01 INTRODUCTION TO DATA SCIENCE & DATA ANALYTICS 3 +0+2 3
SDA MP C02 LINEAR ALGEBRA AND MATRICES 3 +0+2 3
SDA MP C03 PROBABILITY AND DISTRIBUTION THEORY 3 +0+2 3
SDA MP C04 DATA BASE MANAGEMENT SYSTEMS 3+0+2 3
SDA MP C05 PROGRAMMING IN PYTHON FOR DATA SCIENCE 3+0+2 3
SDA MP C06 STATISTICAL INFERENCE 3 +0+2 3
SDA MP C07 PRACTICAL – DATA SCIENCE LAB   3

SEMESTER II

Total Credits 24

   
SDA MP C08 SAMPLING METHODS & DESIGN OF EXPERIMENTS 3+0+ 2 3
SDA MP C09 APPLIED MULTIVARIATE ANALYSIS 3 +0+2 3
SDA MP C10 APPLIED REGRESSION ANALYSIS 3 +0+2 3
SDA MP C11 PROGRAMMING IN R FOR DATA ANALYTICS 3 +0+2 3
SDA MP C12 DATA VISUALIZATION AND PRESENTATION 3+0+2 3
SDA MP E01 ELECTIVE 1 3 +0+2 3
SDA MP E02 ELECTIVE 2 3+0+ 2 3
SDA MP C13 PRACTICAL – DATA ANALYTICS LAB   3

SEMESTER III

Total Credits 24

   
SDA MP C14 MACHINE LEARNING 3+0+2 3
SDA MP C15 STOCHASTIC MODELING & TIME SERIES FORECASTING 3+0+2 3
SDA MP C16 ARTIFICIAL INTELLIGENCE 3 +0+2 3
SDA MP E03 ELECTIVE 3 4 +0+2 4
SDA MP E04 ELECTIVE 4 4 +0+2 4
SDA MP C17 PRACTICAL – COMPUTER LAB   3
  OPEN COURSE 4 +0+0 4

SEMESTER IV

Total Credits 16

   
SDA MP E05 ELECTIVE 5 3 +0+2 3
SDA MP C18 PROJECT / DISSERTATION 20 9
SDA MP C19 PROJECT PRESENTATION & VIVA VOCE   4
 

Grand Total of Credits

 

85

Thus in the present Program Structure there are 19 Core Courses with a total of 64 credits, 5 Elective courses with a total of 17 credits and an Open Course of 4 credits so that the grand total of credits is 85 for the whole program.

 

13. Table of Elective Courses:
Course Code Course Title Teaching hours

 

L+T+P

Credits
E1 NEURAL NETWORKS AND DEEP LEARNING 3+0+2 3
E2 BAYESIAN INFERENCE & COMPUTING 3+0+2 3
E3 BIG DATA AND HADOOP 3+0+2 4
E4 INFORMATION RETRIEVAL TECHNIQUES 3+0+2 3
E5 DATA WAREHOUSING & DATA MINING 3+0+2 3
E6 WEB SCRAPING AND TEXT MINING 3+0+2 3
E7 NATURAL LANGUAGE PROCESSING WITH PYTHON 3+0+2 4
E8 FRAUD ANALYTICS 3+0+2 3
E9 INTERNET OF THINGS IN THE CLOUD 3+0+2 3
E10 OPERATIONS RESEARCH 3+0+2 4
E11 CLOUD COMPUTING 3+0+2 3
E12 BUSINESS INTELLIGENCE & ANALYTICS 3+0+2 4
E13 DATA ANALYTICS COMPUTING 3+0+2 3
E14 COMPLEX NETWORK ANALYSIS 3+0+2 4

N.B. 1 : Open Course is any course offered by another Department / School / Inter University Centre of this University other than the parent Department / School / Centre. Students can select one open course subject to permission from both Departments / Schools to encourage interdisciplinary studies and research in emerging areas.

N.B. 2: In case students wish to undergo online MOOC Courses in SWAYAM PORTAL or offered by IITs and other reputed institutes of national / international importance, they can choose them as electives during Semester 3 and 4 with the permission of the Head of the Department / Director of the School in accordance with the CSS regulations of the university.

N.B. 3: During Semester IV a Major Project / Dissertation shall be carried out in a reputed research institute / department or industry or software company under the joint supervision of an internal faculty and external guide / expert approved by the Head of the Dept / Director of the School. (This can be started from the end of the second semester with initial discussions and literature collection and review.) Each student has to submit a bound copy and soft copy of the Project Report (documented in LaTex) of about 50 to 75 pages certified by the supervisors, at least 7 days before the conduct of the Viva Voce and Project Presentation.

N.B. 4: Along with each course necessary lab training and demonstration of applications of all topics is to be provided in the computer lab using case studies and real data sets as part of practical and records are to be prepared by the students and submitted before lab examinations to the Faculty in charge / Head of the Department / School. Case study reports and presentations and paper publications shall be part of internal assessment.

The M.Sc. Data Science and Analytics program is designed to facilitate learners with a statistics / computing / technology background who wish to become experts in this new  emerging area of Data Analytics  and Data Science. It will also be of interest to learners who have completed their undergraduate degree and wish to specialize in this area. The program combine case studies with statistical software tools, training students to make business analytics decisions backed by facts and data.

Working with data effectively to derive insights is a necessary skill for every individual who would like to make sound decisions. It is widely applied in all areas like financial analysis, disease tracking, criminal investigation, health diagnostics, space research, geo-spatial remote sensing, weather forecasting, business analytics, market research, policy analysis etc. Data Science, Data Analytics, etc.  are emerging areas of applied research also.

Various Workshops, Training Programs, National/ International Seminars and Conferences will be organized for the benefit of faculty  and students. MoUs and Linkages will be signed with reputed industries and research institutes for providing opportunities for collaboration and on-the-job training in emerging areas.

One full semester Project Work and Internship are part of the program and will be carried out in reputed industries / national level institutes / universities etc. for providing wide exposure in interdisciplinary as well as applied research and on the job training.  In the present data driven era of increasing thrust on interdisciplinary applications of Data Science and Data Analytics, there is wide scope and employment  potential for postgraduate and research level interdisciplinary programs in these disciplines. The rules regarding admission and evaluation are guided by CSS Regulations 2020 approved by M.G.University and subsequent amendments.

 The total number of seats for the M.Sc. Program is 20 for students from Kerala State, 4 additional for All India Quota(outside Kerala State) and 4 additional for International Quota, 1 additional for Arts/Sports Quota.

Eligibility for Admission: B.Sc. Degree in Mathematics or Statistics  or Computer Science main / B.Sc. (triple main) with Mathematics, Statistics and Computer Science as main subjects or BCA, B.Sc. / B.Voc. Data Science or Data Analytics, B.E./ B. Tech.( CS or IT) with at least 50% marks (CGPA 5.0 out of 10.00 under grading system) for Part III, provided the candidate has studied at least 2 courses in Probability / Statistics at degree level.

Admission Criteria: Admission will be solely on the basis of a rank list prepared as per the score obtained in the Common Admission Test CAT (80%) and  interview (20%).

Compulsory test under CAT: Test Syllabus: Undergraduate level Computer Science (25%), Mathematics (25%), Probability & Statistics (25%), Logical Reasoning & General Mental Ability (25%).

 

Revised Syllabus