Difference between revisions of "Troque Week 6"
(→Formatting product.txt file: Added solution to processing the product file) |
m (→Formatting the application.txt file: Starting to add some info on my thought process) |
||
Line 25: | Line 25: | ||
=== Formatting the application.txt file === | === Formatting the application.txt file === | ||
+ | Before starting this assignment, I didn't realized that I could easily remove the spaces using the regex for the tab character <code>\t</code>, and so I tried to take out the unnecessary blank spaces using the regex <code>\s*</code>. The result of using this method is the following (which didn't really work out since I got frustrated from all the outliers in this txt file and I eventually trashed this set of commands): | ||
+ | |||
+ | |||
+ | |||
cat application.txt | | cat application.txt | | ||
sed "s/\t/,/g" | | sed "s/\t/,/g" | | ||
Line 47: | Line 51: | ||
OrphanCode) values (/g" | | OrphanCode) values (/g" | | ||
sed "1d" > ~/public_html/application.sql.txt | sed "1d" > ~/public_html/application.sql.txt | ||
− | |||
=== Formatting product.txt file === | === Formatting product.txt file === |
Revision as of 00:27, 15 October 2015
Contents
Creating the Table on PostgreSQL
First, login to the LMU CS server using ssh. Type in the following in a command prompt (Windows) or terminal (Mac) window:
ssh <username@my.cs.lmu.edu>
Enter your password. Note: You will not visibly see the cursor move when typing in your password so just keep typing. You will end up on your home directory.
Here, you can use the command ls
in order to see the list of files in the directory. Then we can start manipulating some files.
On the PostgreSQL command window, type the following to create a table called "application":
create table application (ApplNo int primary key, ApplType varchar, SponsorApplicant varchar, MostRecentLabelFlag boolean, CurrentPatentFlag boolean, ActionType varchar, ChemicalType int, Ther_Potential varchar, OrphanCode varchar);
To create the table for the product data, enter the following command on the PostgreSQL window:
create table product (ApplNo int references application, ProductNo int, Form varchar, Dosage varchar, ProductMktStatus int, TECode varchar, ReferenceDrug int, drugname varchar, activeingred varchar)
Processing the Data on the Command Line
Formatting the application.txt file
Before starting this assignment, I didn't realized that I could easily remove the spaces using the regex for the tab character \t
, and so I tried to take out the unnecessary blank spaces using the regex \s*
. The result of using this method is the following (which didn't really work out since I got frustrated from all the outliers in this txt file and I eventually trashed this set of commands):
cat application.txt | sed "s/\t/,/g" | sed "s/,/,'/1" | sed "s/,'[A-Z]/&'/1" | sed "s/,/,'/2" | sed "s/\s*,False/',False/1" | sed "s/,/,'/5 " | sed "s/,/',/6" | sed "s/,,/,null,/" | sed "s/,,/,null,/" | sed "s/,\r$/,null/g" | sed "s/,S,/,'S',/g" | sed "s/,P,/,'P',/g" | sed "s/,V/,'V'/g" | sed "s/P\*/'P\*'/g" | sed "s/S\*/'S\*'/g" | sed "s/,AP,/,'AP',/g" | sed "s/$/);/g" | sed "s/^/insert into application (ApplNo,ApplType,SponsorApplicant,MostRecentLabelFlag, CurrentPatentFlag,ActionType,ChemicalType,Ther_Potential, OrphanCode) values (/g" | sed "1d" > ~/public_html/application.sql.txt
Formatting product.txt file
cat Product.txt | sed "1d" | sed "s/,/;/g" | sed "s/\t/,/g" | sed "s/,,/,null,/g" | sed "s/'//g" | sed "s/,/,'/2" | sed "s/,/','/3" | sed "s/,/',/4" | sed "s/,/,'/5" | sed "s/,/',/6" | sed "s/,/,'/7" | sed "s/,/','/8" | sed "s/\r$/');/g" | sed "s/'null'/null/" | sed "s/^/insert into product (ApplNo,ProductNo,Form,Dosage,ProductMktStatus,TECode, ReferenceDrug,drugname,activeingred) values (/g" > ~/public_html/Product.sql.txt
Assignment Links
Weekly Assignments
- Week 1
- Week 2
- Week 3
- Week 4
- Week 5
- Week 6
- Week 7
- Week 8
- Week 9
- Week 10
- Week 11
- Week 12
- No Week 13 Assignment
- Week 14
- Week 15
Individual Journal Entries
- Week 1 - This is technically the user page.
- Week 2
- Week 3
- Week 4
- Week 5
- Week 6
- Week 7
- Week 8
- Week 9
- Week 10
- Week 11
- Week 12
- No Week 13 Assignment
- Week 14
- Week 15