Data Smart, Ch 4, Linear Programming

Executive Summary In Chapter 4 of the book Data Smart, by John Foreman, the author uses Excel’s linear programming tool, Solver, to solve an optimization problem, specifically, minimizing the raw materials cost for a commercial orange juice blend. Consistent with this series, here we use R to solve the same problem, specifically, we invoke R’s lpSolve package. …

Continue reading ‘Data Smart, Ch 4, Linear Programming’ »

Predixion Delivers At ‘The Last Mile Of Analytics’

Agenda: ‘From Data Science to Business Impact’, with Jamie MacLennan, Co-founder and CTO of Predixion, at the Data Science Dojo meetup, Redmond, WA, Feb. 3, 2015.   In Brief Prediction has developed an impressive, cloud-based and user friendly predictive analytics framework which can be used on the data organizations have today. The session consisted of: an update on their distinctive and …

Continue reading ‘Predixion Delivers At ‘The Last Mile Of Analytics’’ »

Hadoop Essentials – The Eight Things You Need To Know

Agenda: Hadoop Essentials Live, by Cloudera, Seattle, December 18, 2014.   In brief: Cloudera prepared and presented a comprehensive slidedeck (200 slides) describing Hadoop and its ecosystem. Here are eight lessons from the workshop that are worth learning and remembering. 1. What is Hadoop?  Hadoop is a software framework for storing, processing and analyzing large volumes of data. Its key features are that it …

Continue reading ‘Hadoop Essentials – The Eight Things You Need To Know’ »

Analytics at Ebay – 90% Data Preparation + 10% Insight + 100% Follow-Through Action

Agenda: ‘R and Analytics at Ebay’ at Ebay, Bellevue, Jan 29, 2015, part of the always relevant Data Science Dojo Meetup series in the Seattle area. In Brief Tonight’s event was particularly interesting because, in addition to two strong presentations on data analysis and data science practices at Ebay, it included a personal statement by a 20-year …

Continue reading ‘Analytics at Ebay – 90% Data Preparation + 10% Insight + 100% Follow-Through Action’ »

Zillow opens the kimono – reveals R, Python and Graphlab Create underneath

Meetup: ‘Data Science at Zillow – the Zestimate and Beyond‘, at the Python Data Science Meetup, Seattle, Jan 27th, 2015. Slidedeck: http://slidesha.re/1ALRbvU   In brief Zillow described their 20TB dataset and the technology they use to estimate house values for more than 110 million homes in the US. Zillow uses the statistical programming language, R, for both prototyping and production. The use of Python in Zillow is …

Continue reading ‘Zillow opens the kimono – reveals R, Python and Graphlab Create underneath’ »

Anyone for Amazon Prime? They’re hiring.

The Seattle Technical Forum meetup hosted an Amazon sponsored event on Jan 21 at Bellevue City Hall. The motivation for sponsoring this event is that Amazon Prime is recruiting. Members of the Amazon Prime team spoke about what their product is and what the team does. What I especially noted was the comment that Data Engineers at Amazon must …

Continue reading ‘Anyone for Amazon Prime? They’re hiring.’ »

AWS Talks Cloud Security, Seattle, Oct. 15, 2014

This event was organized by the AWS Seattle Official Events meetup. The speakers were Max Ramsay, Sr Manager for Security Solutions Architecture, AWS and Mark Nunnikhoven, VP, Cloud and Emerging Technologies, Trend Micro. Full agenda here.   Fig. 1 – Shared Security Model AWS Subject To Continuous Security Audits By Customers Gartner positioned AWS as the runaway leader in …

Continue reading ‘AWS Talks Cloud Security, Seattle, Oct. 15, 2014’ »

Inome enjoys ‘Infinite Truth’ and can increase your accessible market. Big Data Bellevue Meetup, Oct. 15, 2014

  Background Inome is in the business of building information on individuals from public data records and other sources. It sells this information to marketing companies. Inome (aka Intelius), has accumulated $1 billion in sales in its ten years of existence. It maintains records on about 300 million people and a graph of about 16 billion edges. …

Continue reading ‘Inome enjoys ‘Infinite Truth’ and can increase your accessible market. Big Data Bellevue Meetup, Oct. 15, 2014’ »

Indeed.com talks data (125 TB of it) in Seattle, Oct. 7, 2014.

This evening’s talk on ‘Data Driven Decision Making at Indeed’ was organized by the Seattle chapter of The Data Warehousing Institute and presented by Chris Hyams, SVP, Product and International, Indeed.com, and co-authored by Douglas Gray, Sr. VP, Engineering. Location: Group Health office, 320, Westlake Ave. N., Seattle. Fig 1: Indeed is responsible for 50% of hires originating from …

Continue reading ‘Indeed.com talks data (125 TB of it) in Seattle, Oct. 7, 2014.’ »

5 FACTS! Doug Cutting, co-creator of Hadoop, speaks at Big Data Seattle Meetup

This evening’s talk was held under the aegis of two meetup groups, Big Data Seattle and Pacific Northwest Cloudera User Group. The location was Disney’s offices at 925 4th Ave., Seattle. Disney has been using Hadoop for about three years to help optimize their ad placements on Disney web properties such as abc.com and espn.com.   Introduction Doug …

Continue reading ‘5 FACTS! Doug Cutting, co-creator of Hadoop, speaks at Big Data Seattle Meetup’ »