GitHub - bautisd/getdata: project

title	GetData CP1
author	Dave Bautis
date	09/19/2014
output	html_document

#Course Project

Get Data

script was tested on linuxmint17 and R 3.0.2

Aquire the data

Script will download a zip file from https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip Script will unzip the file to a tmp directory- tempdir() You man need to modify this part if you are using a different OS

Features

extract the feature list (columns of test/train data) from ./UCI HAR Dataset/features.txt This will be used to set the column names in the test/train data sets

Test data

Extract the test data from ./UCI HAR Dataset/test/X_test.txt Extract the activity column from ./UCI HAR Dataset/test/y_test.txt extract the subjects column from the ./UCI HAR Dataset/test/subjects.txt

Add these columns to the test dataframe

Train data

Extract the test data from ./UCI HAR Dataset/train/X_test.txt Extract the activity column from ./UCI HAR Dataset/train/y_test.txt extract the subjects column from the ./UCI HAR Dataset/train/subjects.txt

Add these columns to the train dataframe

Merge the test_data and train_data

using rbind

Map Activity column to descriptive names

Activity column is currently a numeric there are descriptive names these are extracted : ./UCI HAR Dataset/activity_labels.txt I used a loop instead of hard coding in the event that there was an update to the descriptions in future version of the source data

Extract the mean and std columns

I used grep for mean and std in the column names to extract the numerical indexes I subsetted the merged data set with these column names and subject, activity Then aggregated by the subject and activity columns with MEAN as the function

Output

is written to working directory as output.txt

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
codebook.md		codebook.md
output.txt		output.txt
run_analysis.R		run_analysis.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Get Data

Aquire the data

Features

Test data

Train data

Merge the test_data and train_data

Map Activity column to descriptive names

Extract the mean and std columns

Output

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Get Data

Aquire the data

Features

Test data

Train data

Merge the test_data and train_data

Map Activity column to descriptive names

Extract the mean and std columns

Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages