Data Mining and Knowledge Discovery - Group Project Description: —————————————— This is a Group Project. Locate your Group Members on Canvas.* Part1 ::: Organisation of Group Work --- 1.1. One student assumes the ROLE of Project LEADer any student that feels comfortable LEADing - can assume this ROLE , and do the following ::: 1.2. Set up a Doodle Poll https://doodle.com/create for Group Members to complete their available times / best times for Meetings ( Group Meetings ) SET UP the Doodle Poll as follows ::: Title: “ Group Meeting “ - Continue DAYS : Every Day - Monday, Tuesday, Wednesday, Thursday, Friday, Saturday, Sunday TIMES : create a 2 hour block ( example from 9:00am-11:00am , from 10:00am-12:00pm ) starting from 9:00am - ending until 10:00pm ( on each Day ) Complete it : for 7 Days ( 1 full week ) . use the upcoming week , starting from Monday . See EXAMPLE Doodle Poll here : https://doodle.com/poll/e7cp3yehpradfuqd?utm_source=poll&utm_medium=link 1.3. Decide how to divide the subject material among the group members . Divide the Material - so each student has 5 PowerPoints to create . 1.4. EMail ALL Group Members ( and Copy Cc: / Include the Teaching Assistants ) “ Dear Group # , My name is : … , I assume the Project LEADer Role for our course # . Please fill out your availability for Group Meetings at Doodle Poll --->>> give link to Doodle Poll here . <<<--- Meetings are held Online via Zoom - WEEKLY ( every week ) - for 2 hours . Group member are required to RESPOND and Attend / JOIN the Meetings . For your Individual Contribution , you are assigned to Create 5 PowerPoints on the following Subject : GroupMemberName | Subject GroupMemberName | Subject GroupMemberName | Subject ... list All Members Thank you , “ 1.5. Project LEADer decides what is the best time for Group Meeting , by choosing the Time from the Doodle Poll , during which MAJORITY Group Members are available . All Group Members are REQUIRED to JOIN and Attend a WEEKLY Meeting - for 2 hours . Please be flexible and FILL OUT - as many time slots as possible , in the Doodle Poll - to help find Best Time for Everyone to Meet . 1.6. Project LEADer e-mails a Meeting Reminder - every WEEK - 3 days before the Meeting time . For example , if the Meeting is on Wednesday at 6:00pm-8:00pm , then the Project LEADer e-mails ALL Group Members on Monday by 6:00pm the following : “ Dear Group # , REMINDER - we have a Group Meeting on --->>> give Day and Time here <<<--- example Wednesday at 6:00pm-8:00pm , via Zoom Link : password : ... list All Members Thank you , “ Part2 ::: Individual Contribution Submission --- 2.1. Each student creates 5 PowerPoint slides on your assigned subject (as shown below). Project LEADer Decides how to divide the subject material among the group members . Each student - Submit on Canvas individually - your 5 PowerPoint Slides , and a VIDEO . 2.2. Record a .mp4 video explaining your 5 PowerPoint slides. Length of the video is 5 minutes (maximum 10 minutes). Video should have Animations, that is: include moving objects in the video, or: draw lines, circles, use mouse pointer. Video should contain sound, record your voice reading the PowerPoints text and explaining the concepts. Example good video with Animations: _2.1. https://www.youtube.com/watch?v=ZMBTLuVJtLM - What is the world wide web Twila Camp _2.2. https://www.youtube.com/watch?v=sb7ywQDxgFs - Chapter2 1 2 4 01PhysicalLayer 02 Example poor video (not acceptable): _2.3. https://www.youtube.com/watch?v=A-uDY29YPkU - Ch5.3_5.5_06PacketScheduling_02 2.3. Proper NAMING of submission files _*. NAME your PowerPoint file as ::: Group#_SubjectOfPowerPoint.ppt for example ::: Group03_TelephoneSystem_GSM.ppt _*. NAME your VIDEO file as ::: Group#_SubjectOfVideo.mp4 for example ::: Group03_TelephoneSystem_GSM.mp4 2.4. *Note: Student who works on Writing Code , and records the Code DEMO Video is exempt from Creating PowerPoints Part3 ::: Group Submission Instructions --- 3.1 Total number of PowerPoints for the entire group = number of the group members * 5 For example, if there are 9 members in the group then total number of slides in the PowerPoint = 9 * 5 = 45 Project LEADer Submits one file on Canvas for the entire group. 3.2 Implement your assigned algorithm (as shown below). Use - JAVA - as a programming language . Create a User Interface . 3.3 One student runs a Demonstration of the code before the class , and explains what is the purpose of the code , what inputs it takes , what outputs it produces . 3.4. Code DEMO Video Recording Instructions ::: Record a .mp4 Video Demonstrating the Code : _*1. 1 One student Record the DEMO of the Code , and Explain HOW to RUN the Code - _*2. give the Command for Running the Code , and _*3. Specify any Parameters required for Running the Code , and _*4. give the location of the input Data file ( if any ) . _*5. Explain the LOGIC of the Code . _*6. Show the Output files , and the Location of Output Files . _*7. OPEN the Output files , and Explain the MEANING of the Output . _*8. Create a REDME.txt file - documenting the steps _*.1 - _*.7 above and submit the README.txt file along with your Code on Canvas . 3.5. Submit the project files: PowerPoint file, VideoLink, and SourceCode to Canvas - due: 3 days prior to your assigned Group Moderator Date presentation date ( shown on the syllabus ). 3.6. Present PowerPoints, Video, and Implementation Demo to the class on your assigned Group Moderator Date. ( shown on the syllabus ). 3.7. Each student presents 5 PowerPoint slide, and speaks for 3 to 5 minutes maximum , and prepares 1 question for the audience based on his / her PowerPoint slides. 3.8. Answer questions. Each audience group asks the Presenters 1 question. 3.9. Bring CANDY / Sweets ( ex. chocolates ( sneakers , mars , M&M's , etc.) , cookies , cupcakes , doughnuts ) for the audience. Each audience group gives a score to the presenters from 0 to 10. Part4 ::: RATE my Group Members --- 4.1. Go to GoogleForm link ::: https://docs.google.com/forms/d/1Uvixq7AL1g-m2zGAu5-jma0GonkWjHcSvTbwX8wLD9o and complete 1 Form for each Group Member write comments about the Group Members Contribution to Project Work Group 1 Presentation Subject: Association Rule Mining - Market Basket Analysis Chapter 6.2 - "Introduction to Data Mining (2nd Edition)" implement: Algorithm described in Exercise 2 a, b Chapter 6 - Apriori Algorithm http://webpages.uncc.edu/aatzache/ITCS6162/Exercises/Ch6Ex2a,b.docx Group 2 Presentation Subject: Attribute Selection For Decision Tree Chapter 4.2 - "Introduction to Data Mining (2nd Edition)" implement: algorithm described in Exercise 2 Chapter 4 - Decision Tree http://webpages.uncc.edu/aatzache/ITCS6162/Exercises/Ch4Ex2.docx Group 3 Presentation Subject: Frequent Pattern Growth Strategy (FP-Tree) Chapter 6.8(Table 6.24) - "Introduction to Data Mining (2nd Edition)" implement: Algorithm described in Exercise 8 Chapter 6(Table 6.24) - Frequent Pattern Growth strategy http://webpages.uncc.edu/aatzache/ITCS6162/Exercises/Build_FP-Tree_using_the_data_below.txt Group 4 Presentation Subject: Decision rules - LERS (certain and possible rules) - "Introduction to Data Mining (2nd Edition)" Chapter 6.8 implement: Steps described in the document for reference http://webpages.uncc.edu/aatzache/ITCS6162/PowerPoints/LERS.doc download LERS software - Calculate rules and classify data save your .rses project file ( File | Save As in RSES ) and upload the .rses file to Canvas http://webpages.uncc.edu/aatzache/ITCS6162/Exercises/LERS_Code.zip Group 5 Presentation Subject: WEKA ORANGE - download WEKA software , and ORANGE software - run clustering, association rules discovery, and a decision tree ( use one of the datasets - of your choice - which are pre-loaded in RSES ) Chapter 6 - "Introduction to Data Mining (2nd Edition)" implement: Steps described in the given file http://webpages.uncc.edu/aatzache/DecisionTree_OnlineModule_Tzacheva/Exercise_02_PracticalApplication_BuildDecisionTree_OrangeDataMiningSoftware.pdf Group 6 Presentation Subject: Action Rules Chapter 6 - "Introduction to Data Mining (2nd Edition)" implement: algorithm described in Action Rules Discovery Example http://webpages.uncc.edu/aatzache/ITCS6162/PowerPoints/ActionRuleDiscoveryExample.doc Group 7 Presentation Subject: Similarity Measures using Vectors Chapter 6 - "Introduction to Data Mining (2nd Edition)" implement: Steps described in the file https://webpages.uncc.edu/aatzache/ITCS6162/Exercises/Exercise19Chapter2_ExtraCredit.txt Group 8 Presentation Subject: Use RSES Software | Data Discretization Chapter 6 - "Introduction to Data Mining (2nd Edition)" implement: Follow the example in the given link http://webpages.uncc.edu/aatzache/ITCS6162/PowerPoints/DiscretizationExample_RSES.doc using RSES software | open a dataset | discretize the dataset save your .rses project file ( File | Save As in RSES ) and upload the .rses file to Canvas Group 9 Presentation Subject: Hierarchical Clustering Chapter 8.3 - 8.6 - "Introduction to Data Mining (2nd Edition)" implement: algorithm described in Exercise 16 Chapter 8 - Hierarchical Clustering - Single Link http://webpages.uncc.edu/aatzache/ITCS6162/Exercises/Exercise16.Chapter%208.docx Group 10 Presentation Subject: Partitioning Clustering - K-Means Example Chapter 8 - "Introduction to Data Mining (2nd Edition)" implement: algorithm described in K-Means Clustering – Example http://webpages.uncc.edu/aatzache/ITCS6162/PowerPoints/KMeansExample.doc * Note: Project Participation This is a Group Project . On Canvas locate your Group Members , and obtain their e-mails . This project requires that every student checks his/her UNCC e-mail account, and communicates with his / her group-mates . Contact your group-mates as soon as possible . Be sure to talk to them , meet with them , e-mail , telephone , Facebook or use any other means of communication you like . If a student is reported by his / her group-mates as non-responsive or not participating in the group activities , the student will receive a grade of 0 for this project . If a student is not present ( misses the class ) on the assigned presentation date , the student will receive a grade of 0 for this project .