Please support our Software Development advertiser: Programming Forums
Jun 23rd, 2008, 9:01 pm
You’re about to begin a project that will tap into or integrate data from a database. You’ve been looking for low-cost ways to clear that data of duplicates, near-dupes, and obsolete or garbage data. But cleansing tools are expensive.
As of today, there’s a free solution for you to think about. Integration tools maker Talend today announced general availability of Open Profiler, a GUI-based tool for Linux, Unix and Windows that lets developers peek inside data sources to evaluate the quality of the data they’re about to work with to verify it adheres to project goals or metrics.
Open Profiler 1.0.0RC1 includes a metadata repository, which stores results of its introspections of files and data stores. The metadata can then be used by developers and data analysts to create metrics and indicators. These indicators are statistics such as groups of data with certain numbers of rows, null values, distinct or unique values, and duplicates or blank fields. Other indicators include minimum, maximum and average length of text in fields; computation of numerical summary values such as for mean, average, inner quartile and range definitions; and advanced statistic such as mode and frequency tables. The tool also can render the statistics as tables and graphs.
“Companies in every business face significant losses and inefficiencies that are caused by poor data quality,” said Talend CEO Bertrand Diard. Open Profiler, he continued, “helps companies understand and regain control of the quality of their data.”
Open Profiler 1.0.0RC1 is available now under the GPL 2 open source license.
As of today, there’s a free solution for you to think about. Integration tools maker Talend today announced general availability of Open Profiler, a GUI-based tool for Linux, Unix and Windows that lets developers peek inside data sources to evaluate the quality of the data they’re about to work with to verify it adheres to project goals or metrics.
Open Profiler 1.0.0RC1 includes a metadata repository, which stores results of its introspections of files and data stores. The metadata can then be used by developers and data analysts to create metrics and indicators. These indicators are statistics such as groups of data with certain numbers of rows, null values, distinct or unique values, and duplicates or blank fields. Other indicators include minimum, maximum and average length of text in fields; computation of numerical summary values such as for mean, average, inner quartile and range definitions; and advanced statistic such as mode and frequency tables. The tool also can render the statistics as tables and graphs.
“Companies in every business face significant losses and inefficiencies that are caused by poor data quality,” said Talend CEO Bertrand Diard. Open Profiler, he continued, “helps companies understand and regain control of the quality of their data.”
Open Profiler 1.0.0RC1 is available now under the GPL 2 open source license.
This blog entry was written by Edward J Correia, staff writer aka EddieC. It has received 592 views, 0 comments, and 6 linkbacks. It was promoted to featured status Jun 23rd, 2008.
•
•
•
•
apple ballmer bill gates browsers business computer dell desktop development fedora firefox games google gpl hardware hp ibm internet internet explorer ipod linux merger microsoft mobile mozilla news novell office open open source operating operating systems os red hat search security server software source system ubuntu unix upgrade virtualization vista vmware web windows xp yahoo
All Recent Tags Post Comment
•
•
•
•
Only community members can start a blog or comment on blog entries. You must register or log in to contribute.
•
•
•
•
•
•
•
•
DaniWeb Software Development Marketplace
Related Blog Entries
- Thunder Tables Kill Microsoft 40-bit Encryption (2 Days Ago)
- CMG: Free Performance Data and White Papers (4 Days Ago)
- Ballmer To Apple: Divorce Hardware and Software (8 Days Ago)
- Google Phone Feeding Frenzy (10 Days Ago)
- The six million dollar World of Warcraft bot (12 Days Ago)
- Flash May Soon Brighten the iPhone (13 Days Ago)
- Q and A with Electric Cloud CEO Mike Maciag (15 Days Ago)
- Unlocked iPhone 3Gs Now at Apple Store (15 Days Ago)
- Apple Updates Its Java VM (16 Days Ago)
- Why did Apple take 5 months to fix 24 security holes in OS X Java? (17 Days Ago)
Featured Entry