Introduction R programming language is big – maybe even “HUGE” to some IT folks. The statistical computing capabilities make R a popular tool in the Big Data ecosystem. There are many good resources that provide a good introduction to R. I’ll keep this post to minimal information to get an R programming environment set up... Continue Reading →
Introduction I thought I would put together a quick post with some Oracle stored procedure examples. I wanted some results to compare when writing similar queries/programs in other languages. I plan to use the same NFL games data set with MapReduce, R, SparkR and other programs. Prerequisites Need to set up Oracle tables and load... Continue Reading →
Introduction I missed this meeting due to weather issues in the area. Fortunately, the meeting was recorded and posted online. Doug Cutting, Chief Architecture at Cloudera was the guest speaker. https://stthomas.ensemblevideo.com/Watch/Ai43Lak6 This wasn’t a prepared speech, so there was some ah’s and um’s. The content was good overview of the Hadoop ecosystem – some of... Continue Reading →
Oracle SQL Loader (sqlldr.exe) utility provides an efficient way to perform a bulk data load into an Oracle table. Sqlldr.exe doesn’t record inserts into a transaction log, so performance is improved. The following example was set up to walk through a scenario to load data from a text file into an Oracle database. The steps... Continue Reading →
Oracle XE provides a developer relational database that is great for testing new code without requiring a server installation.
Data Science Examples Blog Introduction