Data Science (DS)

This is an archived copy of the 2021-22 Catalog. To access the most recent version of the catalog, please visit http://catalog.jccc.edu/.

Courses

DS 210   Introduction to Data Science (3 Hours)

In this course, students receive an introduction to the main tools and ideas in the data scientist's toolbox. The course gives an overview of the data, questions, techniques and tools that data analysts and data scientists work with. This course provides a conceptual introduction to the ideas behind turning data into actionable knowledge and tools that will be used to analyze this data. The course will cover collecting, cleaning and sharing data. Additionally, this course will cover how to communicate results through visualizations.

DS 220   Data Visualization (3 Hours)

This course introduces students to key design principles and techniques for interactively visualizing data. In addition to understanding how visual representations are used in the analysis and understanding of complex data, students will acquire data visualization skills, including designing effective visualizations and creating interactive visualizations using various tools.

DS 230   SQL for Data Analysis (3 Hours)

In this course, students will focus on how to apply the Structured Query Language (SQL) to data analysis tasks. Spreadsheets will be used for the visualization of data. Additionally, basic statistics will be covered. All data will be extracted from relational tables.

DS 240   Introduction to Statistical Programming (3 Hours)

Students in this course will use a statistical programming language to perform effective data analysis. Students will acquire programming skills including reading data, accessing statistical packages, writing functions, debugging, profiling code, organizing code and commenting code.

DS 260   Data Mining* (3 Hours)

Prerequisites : DS 210 and DS 240.

This course will provide students with an understanding of fundamental data mining methodologies and the ability to formulate and solve problems with these methodologies. Particular attention will be paid to the process of extracting data, analyzing it from many dimensions or perspectives, then producing a summary of the information in a useful form that identifies relationships within the data. The lectures will be complemented with hands-on experience with data mining software to allow development of execution skills.

DS 270   Introduction to Machine Learning* (3 Hours)

Prerequisites : DS 210 and DS 240.

This introductory course gives an overview of machine learning concepts, techniques and algorithms. Supervised and unsupervised machine learning will be covered. Machine learning is an integral part of data analytics, which deals with developing data-driven insights for better designs and decisions and gives computers the ability to learn without being explicitly programmed.

DS 280   Big Data Architecture (3 Hours)

This course covers emerging big data architectures that deal with large amounts of unstructured and semi-structured data. This course is designed for developers who need to create applications to analyze big data stored in distributed file systems. Topics include file architecture, data retrieval, performance and data analysis.

DS 210

  • Title: Introduction to Data Science
  • Number: DS 210
  • Effective Term: 2021-22
  • Credit Hours: 3
  • Contact Hours: 3
  • Lecture Hours: 3

Description:

In this course, students receive an introduction to the main tools and ideas in the data scientist's toolbox. The course gives an overview of the data, questions, techniques and tools that data analysts and data scientists work with. This course provides a conceptual introduction to the ideas behind turning data into actionable knowledge and tools that will be used to analyze this data. The course will cover collecting, cleaning and sharing data. Additionally, this course will cover how to communicate results through visualizations.

Supplies:

Refer to the instructor's course syllabus for details about any supplies that may be required.

Objectives

  1. Describe data science in the context of big data.
  2. Discuss the ethics of big data.
  3. Describe and apply the data science life cycle.
  4. Describe data.
  5. Describe the extract, transform and load (ETL) process and why it is important.
  6. Apply programming to the ETL process.
  7. Analyze real-world problems based on data analysis techniques.
  8. Explain the solutions.

Content Outline and Competencies:

I. Data Science and Big Data 

A. Describe big data. 

B. Discuss examples of data science. 

C. Discuss case studies. 

II. Ethics of Big Data 

A. Explain identity. 

B. Discuss privacy. 

C. Discuss ownership. 

D. Demonstrate reputation. 

III. Data Science Life Cycle 

A. Identify the problem. 

B. Identify available data sources. 

C. Prepare the data. 

D. Apply data analysis. 

E. Solve the problem. 

F. Explain the solutions. 

IV. Data 

A. Identify types of data. 

B. Demonstrate sources of data. 

C. Explain storage of data. 

D. Illustrate structured versus unstructured data. 

E. Discover messiness of data. 

V. ETL Process 

A. Extract data from data sources. 

B. Transform extracted data. 

C. Load transformed data. 

VI. ETL Programming 

A. Apply ETL basics. 

B. Employ data structures. 

C. Use programming language packages. 

VII. Real-World Problems 

A. Formulate the problem. 

B. Collect relevant data. 

C. Analyze the data. 

D. Apply the techniques using programming. 

VIII. Solutions 

A. Prepare solution documents. 

B. Justify solutions orally. 

C. Present visualizations.  

Method of Evaluation and Competencies:

35-50%    Exams and Quizzes
20-40%    Assignments and Labs
25-30%    Final Project and Presentation, ePortfolio Update

Total: 100%

Grade Criteria:

90 - 100% = A
80 - 89% = B
70 - 79% = C
60 - 69% = D
0 - 59% = F

Caveats:

Student Responsibilities:

Disabilities:

JCCC provides a range of services to allow persons with disabilities to participate in educational programs and activities. If you are a student with a disability and if you are in need of accommodations or services, it is your responsibility to contact Access Services and make a formal request. To schedule an appointment with an Access Advisor or for additional information, you may send an email or call Access Services at (913)469-3521. Access Services is located on the 2nd floor of the Student Center (SC 202).

DS 220

  • Title: Data Visualization
  • Number: DS 220
  • Effective Term: 2021-22
  • Credit Hours: 3
  • Contact Hours: 3
  • Lecture Hours: 3

Description:

This course introduces students to key design principles and techniques for interactively visualizing data. In addition to understanding how visual representations are used in the analysis and understanding of complex data, students will acquire data visualization skills, including designing effective visualizations and creating interactive visualizations using various tools.

Supplies:

Refer to the instructor's course syllabus for details about any supplies that may be required.

Objectives

  1. Describe data visualization in the context of big data.
  2. Recognize graphical integrity.
  3. Describe and apply key design principles.
  4. Describe techniques of data visualization.
  5. Analyze various types of data sets.
  6. Apply visualizations and basic statistics to analyze and understand data.
  7. Analyze real-world problems based on data visualization techniques.
  8. Describe the different types of visualizations and when to use them.
  9. Identify what makes a visualization effective or ineffective.

Content Outline and Competencies:

I. Data Visualization and Big Data

A. Identify misleading graphs.

B. Illustrate stories with data.

C. Discuss the audience.

D. Discuss the story.

II. Graphical Integrity

A. Explain labeling.

B. Demonstrate sourcing.

C. Illustrate managing data relevance and density.

III. Key Design Principles

A. Express the importance of context.

B. Demonstrate exploratory and explanatory analysis.

IV. Data Visualization Techniques

A. Discuss choosing an effective visual.

1. Clutter

2. Decluttering

3. Audience focus

4. Accessibility

5. Aesthetics

6. Acceptance

B. Explain storytelling.

V. Data Sets

A. Use data from the web.

B. Select comma-separated value (CSV) files.

C. Analyze text files.

D. Manipulate relational tables.

VI. Visualization Application

A. Understand and use different types of visualizations to analyze and understand data.

B. Use basic statistical formulas to analyze and understand data.

VII. Real-World Problems

A. Formulate the problem.

B. Collect relevant data.

C. Analyze the data.

D. Apply the techniques of data visualization.

VIII. Types of Visualizations and Uses

A. Determine type of analysis to be visualized.

1. Categorical

2. Hierarchical

3. Relational

4. Temporal

5. Spatial

B. Design and apply appropriate visualization.

IX. Effective and Ineffective Visualizations

A. Evaluate visualizations for effectiveness.

B. Propose improvements of visualizations.

Method of Evaluation and Competencies:

35-50%    Exams and Quizzes
20-40%    Assignments and Labs
25-30%    Final Project and Presentation, ePortfolio Update

Total: 100%

Grade Criteria:

90 - 100% = A
80 - 89% = B
70 - 79% = C
60 - 69% = D
0 - 59% = F

Caveats:

Student Responsibilities:

Disabilities:

JCCC provides a range of services to allow persons with disabilities to participate in educational programs and activities. If you are a student with a disability and if you are in need of accommodations or services, it is your responsibility to contact Access Services and make a formal request. To schedule an appointment with an Access Advisor or for additional information, you may send an email or call Access Services at (913)469-3521. Access Services is located on the 2nd floor of the Student Center (SC 202).

DS 230

  • Title: SQL for Data Analysis
  • Number: DS 230
  • Effective Term: 2021-22
  • Credit Hours: 3
  • Contact Hours: 3
  • Lecture Hours: 3

Description:

In this course, students will focus on how to apply the Structured Query Language (SQL) to data analysis tasks. Spreadsheets will be used for the visualization of data. Additionally, basic statistics will be covered. All data will be extracted from relational tables.

Supplies:

Refer to the instructor's course syllabus for details about any supplies that may be required.

Objectives

  1. Describe SQL in the context of data analysis.
  2. Use SQL queries.
  3. Apply SQL to data exploration.
  4. Use basic statistics.
  5. Apply SQL to time-related data analysis.
  6. Apply SQL for business analysis.
  7. Analyze real-world problems using SQL.
  8. Explain the solutions.

Content Outline and Competencies:

I. SQL and Data Analysis

A. Discuss relevancy.

B. Illustrate Select statement examples.

II. SQL Queries

A. Use the Select clause.

B. Employ the From clause.

C. Apply the Where clause.

D. Illustrate the Group by clause.

E. Apply the Having clause.

F. Interpret the Order by clause.

G. Use the Fetch and Offset clause.

III. Data Exploration

A. Describe data exploration.

B. Use spreadsheets for visualization.

IV. Statistics

A. Describe statistical concepts.

1. Quantitative data

2. Identifier data

3. Categorical data

4. Mean

5. Mode

6. Median

7. Outliers

8. Standard deviation

B. Analyze data using statistics.

V. Time-Related Data Analysis

A. Describe concepts.

1. Distributions

2. Plots

3. Correlation

4. Causation

5. Linear regression

6. Hypothesis testing

B. Analyze time-related data.

VI. Business Analysis

A. Describe various types of business analysis.

B. Apply different SQL analysis techniques to relational data.

1. Selection

2. Filters

3. Aggregates

4. Joins

5. Sorts

VII. Real-World Problems

A. Formulate the problem.

B. Collect relevant data.

C. Analyze the data.

D. Apply analysis using SQL.

VIII. Solutions

A. Prepare solution documents.

B. Justify solutions orally.

C. Present visualizations.

Method of Evaluation and Competencies:

35-50%    Exams and Quizzes
20-40%    Assignments and Labs
25-30%    Final Project and Presentation, ePortfolio Update

Total: 100%

Grade Criteria:

90 - 100% = A
80 - 89% = B
70 - 79% = C
60 - 69% = D
0 - 59% = F

Caveats:

Student Responsibilities:

Disabilities:

JCCC provides a range of services to allow persons with disabilities to participate in educational programs and activities. If you are a student with a disability and if you are in need of accommodations or services, it is your responsibility to contact Access Services and make a formal request. To schedule an appointment with an Access Advisor or for additional information, you may send an email or call Access Services at (913)469-3521. Access Services is located on the 2nd floor of the Student Center (SC 202).

DS 240

  • Title: Introduction to Statistical Programming
  • Number: DS 240
  • Effective Term: 2021-22
  • Credit Hours: 3
  • Contact Hours: 3
  • Lecture Hours: 3

Description:

Students in this course will use a statistical programming language to perform effective data analysis. Students will acquire programming skills including reading data, accessing statistical packages, writing functions, debugging, profiling code, organizing code and commenting code.

Supplies:

Refer to the instructor's course syllabus for details about any supplies that may be required.

Objectives

  1. Describe the statistical programming language in the context of big data.
  2. Describe the statistical software environment.
  3. Apply basic skills of a statistical programming language.
  4. Use the statistical programming language to access data.
  5. Construct statistical programming language functions.
  6. Use the statistical programming language advanced data structures.
  7. Analyze real-world problems using the statistical programming language.
  8. Explain the solutions.

Content Outline and Competencies:

I. Statistical Programming Language and Big Data

A. Describe the statistical programming language.

B. Describe the benefits of a statistical programming language.

C. Apply the statistical programming language to big data.

II. Statistical Programming Language Software Environment

A. Use the statistical programming language integrated development environment (IDE).

B. Explain the statistical programming language packages.

III. Statistical Programming Language

A. Use variables.

B. Demonstrate vectors.

C. Construct control statements.

D. Write loops.

E. Create functions.

F. Interpret missing data.

IV. Data

A. Use comma separated value (CSV) files.

B. Employ spreadsheet data.

C. Select databases.

D. Manipulate data included with the statistical programming language.

E. Use data from websites.

V. Functions

A. Memorize syntax.

B. Use function arguments.

C. Use return values.

VI. Advanced Structures

A. Use data frames.

B. Employ lists.

C. Construct matrices.

D. Create arrays.

VII. Real-World Problems

A. Formulate the problem.

B. Collect relevant data.

C. Analyze the data.

D. Apply the statistical programming language.

VIII. Solutions

A. Prepare solution documents.

B. Justify solutions orally.

C. Present visualizations.

Method of Evaluation and Competencies:

35-50%    Exams and Quizzes
20-40%    Assignments and Labs
25-30%    Final Project and Presentation, ePortfolio Update

Total: 100%

Grade Criteria:

90 - 100% = A
80 - 89% = B
70 - 79% = C
60 - 69% = D
0 - 59% = F

Caveats:

Student Responsibilities:

Disabilities:

JCCC provides a range of services to allow persons with disabilities to participate in educational programs and activities. If you are a student with a disability and if you are in need of accommodations or services, it is your responsibility to contact Access Services and make a formal request. To schedule an appointment with an Access Advisor or for additional information, you may send an email or call Access Services at (913)469-3521. Access Services is located on the 2nd floor of the Student Center (SC 202).

DS 260

  • Title: Data Mining*
  • Number: DS 260
  • Effective Term: 2021-22
  • Credit Hours: 3
  • Contact Hours: 3
  • Lecture Hours: 3

Requirements:

Prerequisites: DS 210 and DS 240.

Description:

This course will provide students with an understanding of fundamental data mining methodologies and the ability to formulate and solve problems with these methodologies. Particular attention will be paid to the process of extracting data, analyzing it from many dimensions or perspectives, then producing a summary of the information in a useful form that identifies relationships within the data. The lectures will be complemented with hands-on experience with data mining software to allow development of execution skills.

Supplies:

Refer to the instructor's course syllabus for details about any supplies that may be required.

Objectives

  1. Describe data mining in the context of big data. 
  2. Describe and apply a data mining methodology. 
  3. Identify inputs to the data mining process. 
  4. Produce outputs from data mining process. 
  5. Analyze real-world problems using data mining techniques. 
  6. Explain the solutions.

Content Outline and Competencies:

I. Data Mining and Big Data

A. Explain data mining and machine learning.

B. Explain patterns.

C. Discuss examples.

D. Discuss ethics.

II. Data Mining Methodology

A. Illustrate business understanding.

B. Apply data understanding.

C. Summarize data preparation.

D. Employ modeling.

E. Discuss evaluation.

G. Create data flow diagrams.

III. Data Mining Process Inputs

A. Determine sparse data.

B. Handle missing values.

C. Identify and treat outliers appropriately.

D. Explain data.

E. Transform data.

F. Extract data using application programming interfaces (APIs).

G. Extract data using web scraping.

IV. Data Mining Process Outputs

A. Produce a variety of supervised models.

B. Produce a variety of unsupervised models.

C. Employ time series analysis.

D. Create dashboards.

V. Real-World Problems

A. Formulate the problem.

B. Collect relevant data.

C. Analyze the data.

D. Apply the techniques of data mining.

VI. Solutions

A. Prepare solution documents.

B. Justify solutions orally.

C. Present visualizations.

Method of Evaluation and Competencies:

35-50%    Exams and Quizzes
20-40%    Assignments and Labs
25-30%    Final Project and Presentation, ePortfolio Update

Total: 100%

Grade Criteria:

90 - 100% = A
80 - 89% = B
70 - 79% = C
60 - 69% = D
0 - 59% = F

Caveats:

Student Responsibilities:

Disabilities:

JCCC provides a range of services to allow persons with disabilities to participate in educational programs and activities. If you are a student with a disability and if you are in need of accommodations or services, it is your responsibility to contact Access Services and make a formal request. To schedule an appointment with an Access Advisor or for additional information, you may send an email or call Access Services at (913)469-3521. Access Services is located on the 2nd floor of the Student Center (SC 202).

DS 270

  • Title: Introduction to Machine Learning*
  • Number: DS 270
  • Effective Term: 2021-22
  • Credit Hours: 3
  • Contact Hours: 3
  • Lecture Hours: 3

Requirements:

Prerequisites: DS 210 and DS 240.

Description:

This introductory course gives an overview of machine learning concepts, techniques and algorithms. Supervised and unsupervised machine learning will be covered. Machine learning is an integral part of data analytics, which deals with developing data-driven insights for better designs and decisions and gives computers the ability to learn without being explicitly programmed.

Supplies:

Refer to the instructor's course syllabus for details about any supplies that may be required.

Objectives

  1. Describe machine learning in the context of big data.
  2. Discuss machine learning basics.
  3. Use structured and unstructured data.
  4. Describe supervised machine learning algorithms.
  5. Create supervised machine learning models.
  6. Describe unsupervised machine learning algorithms.
  7. Create unsupervised machine learning models.
  8. Analyze real-world problems based on machine learning techniques.
  9. Explain the solutions.

Content Outline and Competencies:

I. Machine Learning and Big Data

A. Discuss big data.

B. Discuss artificial intelligence.

II. Machine Learning Basics

A. Define key terminology.

B. Interpret key tasks.

C. Illustrate algorithms.

D. Demonstrate steps in developing an application.

E. Employ programming libraries.

III. Structured and Unstructured Data

A. Discuss supervised learning.

B. Discuss unsupervised learning.

IV. Supervised Machine Learning Algorithms

A. Explain k-nearest neighbors.

B. Explain decision trees and random forests.

C. Explain linear regression.

D. Explain multiple regression.

V. Supervised Machine Learning Algorithms Application

A. Create classification models.

B. Create regression models.

VI. Unsupervised Machine Learning Algorithms

A. Explain association rules.

B. Explain cluster analysis.

VII. Unsupervised Machine Learning Algorithms Application

A. Create association models.

B. Create cluster analysis models.

VIII. Real-World Problems

A. Formulate the problem.

B. Collect relevant data.

C. Analyze the data.

D. Apply the techniques of machine learning.

IX. Solutions

A. Prepare solution documents.

B. Justify solutions orally.

C. Present visualizations.

Method of Evaluation and Competencies:

35-50%    Exams and Quizzes
20-40%    Assignments and Labs
25-30%    Final Project and Presentation, ePortfolio Update

Total: 100%

Grade Criteria:

90 - 100% = A
80 - 89% = B
70 - 79% = C
60 - 69% = D
0 - 59% = F

Caveats:

Student Responsibilities:

Disabilities:

JCCC provides a range of services to allow persons with disabilities to participate in educational programs and activities. If you are a student with a disability and if you are in need of accommodations or services, it is your responsibility to contact Access Services and make a formal request. To schedule an appointment with an Access Advisor or for additional information, you may send an email or call Access Services at (913)469-3521. Access Services is located on the 2nd floor of the Student Center (SC 202).

DS 280

  • Title: Big Data Architecture
  • Number: DS 280
  • Effective Term: 2021-22
  • Credit Hours: 3
  • Contact Hours: 3
  • Lecture Hours: 3

Description:

This course covers emerging big data architectures that deal with large amounts of unstructured and semi-structured data. This course is designed for developers who need to create applications to analyze big data stored in distributed file systems. Topics include file architecture, data retrieval, performance and data analysis.

Supplies:

Refer to the instructor's course syllabus for details about any supplies that may be required.

Objectives

  1. Describe the importance of big data architecture.
  2. Describe big data architectures.
  3. Identify software technologies of big data architecture platforms.
  4. Describe data-related technologies of big data architecture platforms.
  5. Describe storage technologies of big data architecture platforms.
  6. Employ big data architecture platforms for data analysis tasks.
  7. Analyze real-world problems using big data architecture platforms and associated technologies.
  8. Explain the solution.

Content Outline and Competencies:

I. Big Data Architecture

A. Illustrate storing.

B. Demonstrate transforming.

C. Describe analyzing.

II. Architectures

A. Explain data migration.

B. Describe business uses.

III. Software Technologies

A. Discuss open-source software.

1. Data division

2. Data analysis

3. Multiple users

B. Illustrate algorithms.

IV. Data-Related Technologies

A. Practice data retrieval.

B. Practice data analysis.

V. Storage Technologies

A. Show file systems.

B. Demonstrate performance issues and solutions.

C. Practice storage and retrieval of non-relational data.

D. Practice storage and retrieval of relational data.

VI. Data Analysis Using Architecture Platform

A. Use platform technologies for data analysis.

B. Solve case studies.

VII. Real-World Problems

A. Formulate the problem.

B. Collect relevant data.

C. Analyze the data.

D. Apply the techniques of big data architecture platforms and core technologies.

VIII. Solutions

A. Prepare solution documents.

B. Justify solutions orally.

C. Present visualizations.

Method of Evaluation and Competencies:

35-50%    Exams and Quizzes
20-40%    Assignments and Labs
25-30%    Final Project and Presentation, ePortfolio Update

Total: 100%

Grade Criteria:

90 - 100% = A
80 - 89% = B
70 - 79% = C
60 - 69% = D
0 - 59% = F

Caveats:

Student Responsibilities:

Disabilities:

JCCC provides a range of services to allow persons with disabilities to participate in educational programs and activities. If you are a student with a disability and if you are in need of accommodations or services, it is your responsibility to contact Access Services and make a formal request. To schedule an appointment with an Access Advisor or for additional information, you may send an email or call Access Services at (913)469-3521. Access Services is located on the 2nd floor of the Student Center (SC 202).