Introduction Cloudera is a leading data management company. Cloudera has a range of products built on Apache Hadoop and consulting services. For training purposes, Cloudera has a QuickStart that will allow people to quickly set up an environment and start writing code. Downloads Cloudera QuickStarts for CDH NOTE: 4.8 GB Download – so plan accordingly! ... Continue Reading →
Introduction Oracle Cloud Day 2017 (Minneapolis) was held on January 24th. Great seminar sponsored by Oracle and a couple of their partners. Slides for the presentation can be found at http://oracle.cloudday.virtualresourcecenter.com Below are a few notes I took during a particular session. Oracle Keynote: Strategy and Vision for the Journey to Cloud John Dolan –... Continue Reading →
Introduction R programming language is big – maybe even “HUGE” to some IT folks. The statistical computing capabilities make R a popular tool in the Big Data ecosystem. There are many good resources that provide a good introduction to R. I’ll keep this post to minimal information to get an R programming environment set up... Continue Reading →
Introduction I thought I would put together a quick post with some Oracle stored procedure examples. I wanted some results to compare when writing similar queries/programs in other languages. I plan to use the same NFL games data set with MapReduce, R, SparkR and other programs. Prerequisites Need to set up Oracle tables and load... Continue Reading →
Introduction I missed this meeting due to weather issues in the area. Fortunately, the meeting was recorded and posted online. Doug Cutting, Chief Architecture at Cloudera was the guest speaker. https://stthomas.ensemblevideo.com/Watch/Ai43Lak6 This wasn’t a prepared speech, so there was some ah’s and um’s. The content was good overview of the Hadoop ecosystem – some of... Continue Reading →
Oracle SQL Loader (sqlldr.exe) utility provides an efficient way to perform a bulk data load into an Oracle table. Sqlldr.exe doesn’t record inserts into a transaction log, so performance is improved. The following example was set up to walk through a scenario to load data from a text file into an Oracle database. The steps... Continue Reading →
Oracle XE provides a developer relational database that is great for testing new code without requiring a server installation.
Data Science Examples Blog Introduction