What about loading all data from .CSV into database using loader and later filtering out all unwanted records.
well sir, you are right!
The csv file file that I loaded had 77 records, of which only 59 records were added to the Oracle table!
In some columns there are blank spaces which are NULL by default and in some columns there are entries like 'fifty' or 'zero' instead of 50 or 0. Those replacements have to be done!
This is the discard file!
1,Nord-Ost Matjeshering,1,1,10 boxes x 20 bags,18,39,0,10,
3,Mascarpone Fabioli, ,2,12 - 550 ml bottles,10,13,70,25,FALSE
5,Sasquatch Ale,2,2,36 boxes,21.35, NA,0,0,TRUE
8,Gravad lax,3,2,12 - 12 oz jars,40,6, ,0,FALSE
12,Jack's New England Clam Chowder,5,4,10 - 500 g pkgs., ,86,0,0,FALSE
14,Ipoh Coffee, ,7,40 - 100 g pkgs.,23.25,35,0,0,FALSE
16,Rogede sild,7,3,32 - 500 g boxes,17.45, fifty,0,10,FALSE
17,Alice Mutton,7,6,20 - 1 kg tins,39,0,0,0,
19,Teatime Chocolate Biscuits,8,3,10 boxes x 12 pieces,9.2,25, ,5,FALSE
20,Sir Rodney's Marmalade, ,3,30 gift boxes,81,40,0,0,FALSE
22,Grape sause,9,5,24 - 500 g pkgs.,21,104, ,25,
29,Thüringer Rostbratwurst,12,6,50 bags x 30 sausgs.,123.79,0,0,0,TRUE
34,Chef Anton's Gumbo Mix,16,1,24 - 12 oz bottles,14,111, zero,15,FALSE
38,Mishi Kobe Niku,18,1,12 - 75 cl bottles,263.5,17,0,15,FALSE
52,Filo Mix,24,5,16 - 2 kg boxes, seven,38,0,25,FALSE
71,Flotemysost,15,4,10 - 500 g pkgs.,21.5,26, zero,0,FALSE
74,Longlife Tofu,4,7, twenty,10,4,20,5,
77,Original Frankfurter ,12,2,12,13, ,0,15,FALSE
I want to include entries of 43, 71 and 74 but how can I convert words into numbers?
Should I use Java to automate Data cleaning ?
I cannot find any help for that!
Attached: Oracle table with valid values.