Loading…
This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
Lost? Have questions? Visit the onsite Hospitality Desk located in Festival pavilion.
View analytic

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Tuesday, June 11
 

7:00am

Open Registration
Tuesday June 11, 2013 7:00am - 9:00am
Festival Pavilion Fort Mason, San Francisco, CA

7:30am

Breakfast
Tuesday June 11, 2013 7:30am - 8:30am
Fort Mason Fort Mason, San Francisco, CA

7:30am

Press Relations Breakfast - Invite Only
Tuesday June 11, 2013 7:30am - 8:30am
Festival Pavilion Fort Mason, San Francisco, CA

8:45am

Keynote Sessions
Tuesday June 11, 2013 8:45am - 9:30am
Festival Pavilion Fort Mason, San Francisco, CA

9:30am

What were they thinking?

Nothing is more frustrating than knowing you may have the right solution to a problem, only to have the rug pulled out from under you and your project gets derailed for unknown reasons or for unclear understanding of your solution.  While you may not see your job  as being a business champion, the fact is that you can be a powerful force to stop your company from making the wrong choice.

Understanding how others in the organization see Cassandra and similar technologies can help bridge the gap between engineering and management.  In this plenary session, Accenture Big Data practice lead Vince Dell’Anno and Accenture senior big data architect TJ Glazier show you how managers see competing technologies and how you can help them cut through the noise.


Speakers
TG

TJ Glazier

Senior Big Data Architect, Accenture


Tuesday June 11, 2013 9:30am - 10:20am
Festival Pavilion Fort Mason, San Francisco, CA

10:20am

Break
Tuesday June 11, 2013 10:20am - 10:50am
Fort Mason Fort Mason, San Francisco, CA

10:50am

Big Architectures for Big Data
Having many different technologies within an organization can be problematic for developers and operations alike. Structuring those systems into discrete modules not only abstracts away a lot of the complexity of a heterogeneous architecture, it also allows the evolution of systems using common access and storage patterns. This session will discuss how to think about, architect, and maintain a service architecture for a big data system.

Speakers
EL

Eric Lubow

CTO & Co-Founder, SimpleReach
Eric Lubow, CTO of SimpleReach, began his career building secure systems at Guardian Digital before joining Conductor.com, where he developed and ran their ad systems. After graduating from Rutgers University with a degree in Information Systems, Eric did multiple tours with the U.S. Army including serving in Iraq as a Lieutenant and Albania as a Captain. He is a board advisor for the IronMatt.org charity for pediatric brain tumors, a mixed... Read More →


Tuesday June 11, 2013 10:50am - 11:20am
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

10:50am

Crossing the Chasm: SQL to NoSQL
Over the past few years, Health Market Science has transitioned from traditional relational databases and enterprise systems to a massively scalable Big Data platform that combines Cassandra and Storm to ingest thousands of feeds of data from the health market industry to produce a single high-quality masterfile. Come hear the "Why?", "What for?" and "How?" of that evolution.

Speakers

Tuesday June 11, 2013 10:50am - 11:20am
Firehouse Building Fort Mason, San Francisco, CA

10:50am

Netflix Open Source Tools and Benchmarks for Cassandra
Netflix has updated and added new tools and benchmarks for Cassandra in the last year. In this talk we will cover the latest additions and recipes for the Astyanax Java client, updates to Priam to support Cassandra 1.2 Vnodes, plus newly released and upcoming tools that are all part of the NetflixOSS platform. Following on from the Cassandra on SSD on AWS benchmark that was run live during the 2012 Summit, we've been benchmarking a large write intensive multi-region cluster to see how far we can push it. Cassandra is the data storage and global replication foundation for the Cloud Native architecture that runs Netflix streaming for 36 Million users. Netflix is also offering a Cloud Prize for open source contributions to NetflixOSS, and there are ten categories including Best Datastore Integration and Best Contribution to Performance Improvements, with $10K cash and $5K of AWS credits for each winner. We'd like to pay you to use our free software!

Speakers
AC

Adrian Cockcroft

Cloud Architect, Netflix
Adrian Cockcroft is the director of architecture for the Cloud Systems team at Netflix. He is focused on availability, resilience, performance, and measurement of the Netflix cloud platform, and has presented at many conferences, including the Cassandra Summit, QCon/GOTO, Cloud Connect, Velocity, Gluecon and Structure. Adrian is also well known as the author of several books while a Distinguished Engineer at Sun Microsystems: Sun Performance... Read More →


Tuesday June 11, 2013 10:50am - 11:50am
Festival Pavilion Fort Mason, San Francisco, CA

10:50am

(Re)-Building the Social Grid for Global Telcos @ 1/10th the Market Cost
Darshan Rawal is leading the development of hybrid cloud based messaging products for global Tier 1 Telcos. Darshan has been working in Silicon valley since 2000, building nimble, cost effective products/services, handling millions of users and billions of transactions per day. Previous to Openwave Messaging, Darshan held engineering positions @ SS8 networks, Yahoo, DE Shaw, yp.com and has a M.S in Software Engineering from Carnegie Mellon University.

Speakers
DR

Darshan Rawal

VP of Engineering, Openwave Messaging
Darshan Rawal is leading the development of hybrid cloud based messaging products for global Tier 1 Telcos. Darshan has been working in Silicon valley since 2000, building nimble, cost effective products/services, handling millions of users and billions of transactions per day. Previous to Openwave Messaging, Darshan held engineering positions @ SS8 networks, Yahoo, DE Shaw, yp.com and has a M.S in Software Engineering from Carnegie Mellon... Read More →


Tuesday June 11, 2013 10:50am - 11:50am
Marina Room Conference Center Fort Mason, San Francisco, CA

10:50am

The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop
NativeX (formerly W3i) recently transitioned a large portion of their backend infrastructure from Microsoft SQL Server to Apache Cassandra. Today, its Cassandra cluster backs its mobile advertising network supporting over 10 million daily active users that produce over 10,000 transactions per second with an average database request latency of under 2 milliseconds. Come hear our story about how we were successful at getting our .NET web apps to reliably connect to Cassandra. Come learn about FluentCassandra, Snowflake, Hector, and IKVM. It's a story of struggle and perseverance, where everyone lives happily ever after.

Speakers
DB

Derek Bromenshenkel

Infrastructure Architect, NativeX
Derek Bromenshenkel is an Infrastructure Architect at NativeX with over 6 years of software development experience. Recently he's helped NativeX's Mobile Business become more scalable and resilient by utilizing Cassandra for high volume data persistence.
JS

Jeff Smoley

Infrastructure Architect, NativeX
Jeff Smoley is an Infrastructure Architect at NativeX with over 13 years of software development experience ranging from VB6 desktop apps to high transaction ASP .Net/WCF web applications. His focus is on building resilient and maintainable applications. Most recently he's helped NativeX's Mobile Business become more scalable and resilient by utilizing Cassandra for high volume data persistence.


Tuesday June 11, 2013 10:50am - 11:50am
Fleet Room - Building D Fort Mason, San Francisco, CA

10:50am

DataStax Certification
Tuesday June 11, 2013 10:50am - 12:20pm
Building C370 Fort Mason, San Francisco, CA

11:20am

Ground Traffic Control - Logistics with Cassandra
Come learn about how Zonar Systems uses Cassandra for logistics use cases such as tracking fleets of school buses and other fleet management services. Zonar uses Cassandra because because of its ability to scale horizontally, its continuous availability and operational ease. This talk will cover details about the implementation and our 3 year journey that got us here, including the challenges along the way.

Speakers
JY

Jesse Young

Director of Research, Zonar Systems
While completing my bachelor's degree in computer engineering, I was hired at Zonar Systems as an Electronics Technician. Zonar allowed me to attend classes in the morning while working in the afternoons and evenings. Working here, I was able to branch out and collaborate with the different divisions in the company. After only 3 months, I was promoted to System Administrator where I learned more about the internal workings of the Zonar system... Read More →


Tuesday June 11, 2013 11:20am - 11:50am
Firehouse Building Fort Mason, San Francisco, CA

11:20am

No, Really. These Companies Transformed Their Businesses Using Cassandra
During this panel discussion, IT executives from Walmart, Constant Contact, Ooyala and HealthCare Anytime will discuss how they are transforming their businesses using the power of big data. The panelists will discuss how the increased expectations of customers and the evolving role of IT caused them to each lead initiatives designed to change their business, the technical challenges they faced and how DataStax Enterprise helped them achieve their goals. Lara Shackelford, vice president of marketing at DataStax will lead the discussion.

Speakers
TD

Terrell Deppe

CTO, HealthCare Anytime
Terrell is the Chief Technology Officer of HealthCare Anytime. Terrell has nearly 30 years of professional software development experience. Nearly half of his career has been spent in healthcare where he has addressed the needs of patient, payers and providers.
SK

Sean Knapp

Executive Vice President and Chief Product Officer, Ooyala
Sean Knapp is a co-founder of Ooyala. As Executive Vice President and Chief Product Officer, he oversees all product and engineering and helps to define and execute Ooyala's product vision. In his previous role as Ooyala's CTO, Sean led the development of Ooyala‚Äôs award-winning scalable, efficient and powerful video solutions. As CTO, he shaped Ooyala's technical strategy and roadmap and led technical collaborations with partners, earning... Read More →
SO

Stephen O'Sullivan

Architect, Silicon Valley Data Science
SP

Stefan Piesche

Chief Technology Officer, Constant Contact
As Chief Technology Officer, Piesche focuses on evaluating new technologies to determine how they can help expand the Constant Contact's products and service architecture to accommodate future growth and increased performance. To do this, he combines technology vision with practical delivery capabilities and engineering best practices. Prior to Constant Contact, Piesche spent eight years at the Cobalt Group, where he was executive director of... Read More →


Tuesday June 11, 2013 11:20am - 12:20pm
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

11:50am

Big Data Analytics - Realize the Investment from Your Big Data Clusters
The term "big data" seems to be everywhere these days. With the ever growing number of attendees at big data and Hadoop events, it's clear big data is here to stay. But what does that mean for the analytics market, and how does big data fit into the picture? This session, featuring Mark Davis, Sr. Product Architect at Dell, will explore what big data means in a practical sense to the IT department. It will also explore the many ways that big data affects an organization's picture of performance. Plus, see how big data analytics, using technologies like Cassandra and Hadoop, will converge with traditional business intelligence to create a complete picture of the enterprise's information assets, thereby giving the business a complete and insightful view of its operational efficiency.

Speakers
MD

Mark Davis

Principal Engineer, Dell
Mark Davis is founder of Kitenga Inc, acquired by Dell Software in 2012, and serves as its Chief Technology Officer. Mr. Davis has 19 years experience in computational linguistics, information retrieval, business development in text analytics, information access and related areas. He founded Inventix, was also a DoD/Intelligence Community researcher at Computing Research Lab in New Mexico for 7 years and was author or co-author on over 20 papers... Read More →


Tuesday June 11, 2013 11:50am - 12:20pm
Festival Pavilion Fort Mason, San Francisco, CA

11:50am

Real-time Analytics using Cassandra, Spark and Shark
This session covers our experience with using the Spark and Shark frameworks for running real-time queries on top of Cassandra data.We will start by surveying the current Cassandra analytics landscape, including Hadoop and HIVE, and touch on the use of custom input formats to extract data from Cassandra. We will then dive into Spark and Shark, two memory-based cluster computing frameworks, and how they enable often dramatic improvements in query speed and productivity, over the standard solutions today.

Speakers
EC

Evan Chan

Senior Software Engineer, Ooyala
This session covers our experience with using the Spark and Shark frameworks for running real-time queries on top of Cassandra data.We will start by surveying the current Cassandra analytics landscape, including Hadoop and HIVE, and touch on the use of custom input formats to extract data from Cassandra. We will then dive into Spark and Shark, two memory-based cluster computing frameworks, and how they enable often dramatic improvements in... Read More →


Tuesday June 11, 2013 11:50am - 12:20pm
Fleet Room - Building D Fort Mason, San Francisco, CA

11:50am

Time-Series Metrics with Cassandra
Librato's Metrics platform relies on Cassandra as its sole data storage platform for time-series data. This session will discuss how we have scaled from a single six node Cassandra ring two years ago to the multiple storage rings that handle over 150,000 writes/second today. We'll cover the steps we have taken to scale the platform including the evolution of our underlying schema, operational tricks, and client-library improvements. The session will finish with our suggestions on how we believe Cassandra as a project and its community can be improved.

Speakers
MH

Mike Heffner

Engineer & Co-Founder, Librato
Mike Heffner works at Librato ensuring their metrics platform is scalable and performant. At Librato he spends his time wrangling their Cassandra datastore, building distributed backend systems for measurement processing and assisting on development of various API components. Mike enjoys contributing to open source and has been active in the startup and academic environments for many years.


Tuesday June 11, 2013 11:50am - 12:20pm
Marina Room Conference Center Fort Mason, San Francisco, CA

11:50am

Time for a New Relationship - Intuit's Journey from RDBMS to Cassandra
This session talks about Intuit’s journey of our Consumer Financial Platform that is built to scale to petabytes of data. The original system used a major RDBMS and from there, we redesigned to use the distributed nature of Cassandra. This talk will go through our transition including the data model used for the final product. As with any large system transition, many hard lessons are learned and we will discuss those and share our experiences.

Speakers

Tuesday June 11, 2013 11:50am - 12:20pm
Firehouse Building Fort Mason, San Francisco, CA

12:30pm

Lunch
Tuesday June 11, 2013 12:30pm - 1:30pm
Fort Mason Fort Mason, San Francisco, CA

1:30pm

Eventual Consistency != Hopeful Consistency
This session will address Cassandra's tunable consistency model and cover how developers and companies should adopt a more Optimistic Software Design model.

Speakers
CK

Christos Kalantzis

Engineering Manager of Cloud Persistence Engineering, Netflix
A humanitarian at heart, I try to focus my efforts in creating technology that enhances our lives. I have built and lead teams at YouSendIt and Netflix to scale out their persistence layers, create a cloud file system and to adopt Apache Cassandra as a scalable and highly available data solution. I've worked as a DB2, SQL Server and MySQL DBA for over 10 years and through, sometimes painful, trial and error I have learned the advantages and... Read More →


Tuesday June 11, 2013 1:30pm - 2:00pm
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

1:30pm

High Throughput Analytics with Cassandra
Building analytics systems is an increasingly common requirement for BI teams inside companies both big and small, and a feat made even more challenging when analytic results have to be produced in real-time. In this presentation the team from MarkedUp Analytics will show you techniques for leveraging Cassandra, Hadoop, and Hive to build a manageable and scalable analytics system capable of handling a wide range of business cases and needs.

Speakers

Tuesday June 11, 2013 1:30pm - 2:00pm
Marina Room Conference Center Fort Mason, San Francisco, CA

1:30pm

Stop Crippling Your Business: Fundamental Considerations Everyone Needs to Know
All too often, the wrong standardization and/or corporate bureaucracy can absolutely undermine the success of a new project. In this panel discussion we will explore the necessary requirements and flexibility needed to efficiently handle all aspects of your big data infrastructure: from hardware, to software, to implementation strategy. Mistakes in any one of these categories can fundamentally limit your ability to execute, leading to tremendous cost and missed opportunity.

Moderators
BB

Billy Bosworth

CEO, DataStax
Billy is responsible for the strategy, explosive growth, and day-to-day operations of DataStax. He has 20 years of experience in the database industry in roles ranging from DBA to senior executive. Prior to DataStax, Billy spent 6 years at Quest Software, a provider of systems management software, where his most recent role was VP and GM of the database business unit. Under his leadership, the industry-leading Quest database business grew from... Read More →

Speakers

Tuesday June 11, 2013 1:30pm - 2:30pm
Firehouse Building Fort Mason, San Francisco, CA

1:30pm

Optimizing the Public Cloud for Cost and Scalability with Cassandra - The MetricsHub Story
MetricsHub is a monitoring and scalability service for public clouds, allowing companies to continuously gather data from their systems and auto-scale their deployments to optimize service costs. Taking advantage of Cassandra rapid ingestion rates, reliable replication model, and easiness of deployment, Metrics Hub can handle billions of datapoints per day. During this session, you will learn about the architecture supporting this service, which combines the power of the PaaS + IaaS on the Windows Azure platform.

Speakers
CL

Charles Lamanna

Founder and Developer Lead, Metricshub, Microsoft
Charles currently works on the Windows Azure monitoring team to define the next generation of cloud monitoring and management. Charles was a Responsible for technical and business areas for MetricsHub. He was a member of founding team and developed the company from idea stage, to revenue and then to exit. MetricsHub was acquired by Microsoft on March 4th, 2013. The premium MetricsHub product was offered as a no charge service following the... Read More →


Tuesday June 11, 2013 1:30pm - 2:30pm
Fleet Room - Building D Fort Mason, San Francisco, CA

1:30pm

Suicide Risk Prediction Using Social Media and Cassandra
In this presentation, Ken will describe a portion of an early-phase project that uses social media data (tweets, Facebook posts, etc.) from service personnel to predict suicide rates. There's a lot of motivation to provide better data for military psychologies, since more military wind up taking their own lives than are killed in the line of duty. By analyzing social media data that is voluntarily provided by personnel, plus a predictive analytics system, we can provide assessments that help mental health workers focus their time and energy on the most at-risk individuals. This project uses Cassandra as the scalable storage system for this social media data, which is then analyzed in a distributed environment using Hadoop. The project also uses the Solr search support from DataStax Enterprise to provide ways for users to dig into the underlying data, which is critical when understanding the assigned risk levels.

Speakers
KK

Ken Krugler

Big Data Consulting, Scale Unlimited
Ken is the President of Scale Unlimited, a consulting and training company for big data processing and web mining problems, using Hadoop, Cascading, and Solr.


Tuesday June 11, 2013 1:30pm - 2:30pm
Festival Pavilion Fort Mason, San Francisco, CA

1:30pm

Meet the Experts
Tuesday June 11, 2013 1:30pm - 4:30pm
Building C370 Fort Mason, San Francisco, CA

2:00pm

Lock it Up: Securing Sensitive Data
As adoption of NoSQL solutions like Apache Cassandra grows, so too does the likelihood that organizations will use it to capture and analyze sensitive data. Enterprises that don't take every precaution to protect this data leave themselves exposed to risk of a data breach, and depending on the regulatory nature of the data, fines for noncompliance. This session will discuss how transparent data encryption and advanced key management protect data at-rest and in-flight, so regardless of where the data resides — either on premises or in the cloud -- it remains garbled and unreadable to all people, processes and applications that don't require immediate access. The session will also cover DevOps automation tools that ensure rapid distributed deployment of big data security across thousands of nodes.

Speakers
SH

Sam Heywood

Sr. Director of Products, Gazzang
Sam Heywood is senior director of products at Gazzang, responsible for direction of the company's data security and big data solutions. He is a technology industry veteran and strong proponent of the cloud as a platform for developing and delivering enterprise-business solutions. Prior to joining Gazzang, Sam was responsible for driving product roadmap, planning and delivery processes at UShip. He has also held product and marketing management... Read More →


Tuesday June 11, 2013 2:00pm - 2:30pm
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

2:00pm

Deciding Dollars: It's Actually Actuarial
In today's world, data is growing faster than ever. For online apps, two things matter more than anything else for the database: uptime and performance. The intersection between data growth and online requirements results in interesting technology choices. This panel will discuss the implications - and approaches - to maximize revenue via technology decisions.

Moderators
Speakers
RB

Rick Branson

Infrastructure Engineer, Instagram
Rick is an Infrastructure Engineer at Instagram where he spends his days building scalable systems to deal with their meteoric growth rates. Rick is a totally impatient dude, so he’s spent the last 10 years obsessed with making complex production systems faster and more reliable. Previously he was a Solutions Architect at DataStax, and has held various engineering roles at Coroutine, Mosaik Solutions, and FedEx.
AC

Adrian Cockcroft

Cloud Architect, Netflix
Adrian Cockcroft is the director of architecture for the Cloud Systems team at Netflix. He is focused on availability, resilience, performance, and measurement of the Netflix cloud platform, and has presented at many conferences, including the Cassandra Summit, QCon/GOTO, Cloud Connect, Velocity, Gluecon and Structure. Adrian is also well known as the author of several books while a Distinguished Engineer at Sun Microsystems: Sun Performance... Read More →


Tuesday June 11, 2013 2:00pm - 2:30pm
Marina Room Conference Center Fort Mason, San Francisco, CA

2:30pm

It's Like Your Parents: Relational and NoSQL can co-exist
25% of the world's online video traffic is analyzed by Ooyala. So it's a good thing that when Sean founded the company, he didn't worry about pointless marketing battles between this or that technology. Instead, he set out to solve real problems with no pre-conceived biases. In this session, you will learn how he ended up with a blend of NoSQL, relational, and Hadoop; each playing their part in a new era of application design.

Speakers
SK

Sean Knapp

Executive Vice President and Chief Product Officer, Ooyala
Sean Knapp is a co-founder of Ooyala. As Executive Vice President and Chief Product Officer, he oversees all product and engineering and helps to define and execute Ooyala's product vision. In his previous role as Ooyala's CTO, Sean led the development of Ooyala‚Äôs award-winning scalable, efficient and powerful video solutions. As CTO, he shaped Ooyala's technical strategy and roadmap and led technical collaborations with partners, earning... Read More →


Tuesday June 11, 2013 2:30pm - 3:00pm
Firehouse Building Fort Mason, San Francisco, CA

2:30pm

Cassandra and the Titan Graph Database
This presentation introduces Titan, Faunus, and scalable graph computing in general. We present a case study of how Pearson builds an education social network on top of Titan, Faunus, and Cassandra to support learning in the 21st century. Titan is an open source distributed graph database build on top of Cassandra that can power real-time applications with thousands of concurrent users over graphs with billions of edges. Faunus is an open source global graph processing engine build on top of Hadoop and compatible with Cassandra that can analyze graphs, compute graph statistics, and execute global traversals. Titan and Faunus are components of the Aurelius Graph Cluster which enables scalable graph computation and powers applications in social networking, recommendation engines, advertisement optimization, knowledge representation, health care, education, and security.

Speakers
MB

Matthias Broecheler

CTO, Aurelius
Matthias Broecheler is the lead developer of the distributed graph database Titan and has researched large scale graph database systems for more than 5 years. His award-winning research includes high performance index structures and query answering algorithms for graph structured data. In addition, he developed the Probabilistic Similarity Logic (PSL) machine learning framework to analyze and reason about multi-relational data. Matthias holds a... Read More →


Tuesday June 11, 2013 2:30pm - 3:00pm
Festival Pavilion Fort Mason, San Francisco, CA

2:30pm

Data Modelers Still Have Jobs: Adjusting For the NoSQL Environment
Using concrete, real-world examples, the presenter will show the following: How abandoning modeling altogether is a recipe for disaster, even in—or especially in—NoSQL environments; How experienced relational modelers can leverage their skills for NoSQL projects; How the NoSQL context both simplifies and complicates the modeling endeavor.How lessons learned modeling for NoSQL projects can make you a more effective modeler for any kind of project

Speakers
JM

Joe Maguire

Founder, Data Quality Strategies, LLC
Now in his fourth decade in the computer industry, Joe Maguire is an analyst and consultant specializing in data management and requirements analysis. His hard-won perspective is informed by broad experience including twelve years in product development for software vendors (Digital, Lotus, Microsoft, Bachman Information Systems); thirteen years as a consulting modeler, requirements analyst, and information architect for clients (ranging from... Read More →


Tuesday June 11, 2013 2:30pm - 3:00pm
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

2:30pm

Virtual Nodes: Rethinking Topology in Cassandra
A discussion of the recent work to transition Cassandra from its naive 1-partition-per-node distribution, to a proper virtual nodes implementation.

Speakers
EE

Eric Evans

Apache Cassandra Committer and Chief Architect, OpenNMS
Eric has more than a decade of experience in large-scale distributed systems, having held roles in both operations and engineering. An early employee of Rackspace, he implemented a global DNS infrastructure utilizing IP anycast (possibly the first), and a novel data-center-wide IDS for which a patent was awarded. An avid open source hacker, Eric is a developer with the Debian Project and a member of the Apache Cassandra PMC. He resides in Texas... Read More →


Tuesday June 11, 2013 2:30pm - 3:00pm
Fleet Room - Building D Fort Mason, San Francisco, CA

2:30pm

Hardware Agnostic: Cassandra on Raspberry Pi
The raspberry Pi is a credit-card sized $25 ARM based linux box designed to teach children the basics of programming. The machine comes with a 700MHz ARM and 512Mb of memory and boots off a SD card, not much power for running the likes of a Cassandra cluster. This presentation will discuss the problems of getting Cassandra up and running on the Pi and will answer the all important question: Why on Earth would you want to do this!?

Speakers
AC

Andy Cobley

Lecturer, University of Dundee
Andy Cobley is a lecturer at the University of Dundee in Big Data, Business Intelligence and Internet programming.


Tuesday June 11, 2013 2:30pm - 3:00pm
Marina Room Conference Center Fort Mason, San Francisco, CA

3:00pm

Break
Tuesday June 11, 2013 3:00pm - 3:30pm
Fort Mason Fort Mason, San Francisco, CA

3:30pm

Processing an Avalanche of Medical Records
HealthCare Anytime provides Web-based portal solutions that assist healthcare organizations in achieving meaningful use, optimized operations, and increased patient and staff satisfaction. During this speaking session, HealthCare Anytime CTO Terrell Deppe will discuss the challenges his company faced when processing an "avalanche" of patient records and how he utilized DataStax's Cassandra-based big data platform to improve their product's performance while reducing costs.

Speakers
TD

Terrell Deppe

CTO, HealthCare Anytime
Terrell is the Chief Technology Officer of HealthCare Anytime. Terrell has nearly 30 years of professional software development experience. Nearly half of his career has been spent in healthcare where he has addressed the needs of patient, payers and providers.


Tuesday June 11, 2013 3:30pm - 4:00pm
Festival Pavilion Fort Mason, San Francisco, CA

3:30pm

Does Big Data Drive Apps, or Do Apps Drive Big Data?
Godfrey Sullivan has seen it all when it comes to the technology world. Especially when it comes to line of business applications, big data, and big ideas. From his early days at Apple, to CEO positions at Hyperion and Splunk, to board seats at Citrix and Informatica, Godfrey has developed an amazing understanding of the business value and potential in data. In this discussion, we will explore the drivers for big data. What comes first? The applications, the data, or the customers?

Speakers
GS

Godfrey Sullivan

CEO, Splunk
Godfrey R. Sullivan has served as Splunk's President, Chief Executive Officer and a member of the board of directors since 2008, and as the Chairman since 2011. Prior to joining Splunk, Mr. Sullivan was with Hyperion Solutions Corporation, a performance management software company acquired by Oracle Corporation, from 2001 to 2007, where he served in various executive roles, most recently as President and Chief Executive Officer, and as a member... Read More →


Tuesday June 11, 2013 3:30pm - 4:30pm
Firehouse Building Fort Mason, San Francisco, CA

3:30pm

Taking Bytes from Cassandra Clients
Since early 2012 Gora has been proudly participating as an honorary Incubator post-grad within the ASF. This presentation provides Renato and Lewis' perspective on a phenomenon they refer to as the "big datastore client wars", which is a real life challenge they've discovered whilst attempting to integrate several big data backends (Accumulo, Cassandra, HBase, MySQL, HSQLDB, Amazon's DynamoDB, MongoDB) under one common persistence layer and, in the process, obtain optimal results over Gora operations. They emphasize their approach to addressing this problem by discussing a pluggable Cassandra client infrastructure (Hector-client, Datastax java driver, intravert-ug, etc) adapted specifically for the gora-cassandra module.

Speakers
LJ

Lewis John McGibbney

Post Doctoral, Stanford University
Legislative Informatics PhD. from Glasgow Caledonian University, Glasgow, Scotland. Currently a Post Doctoral Research Scholar within Civil and Environmental Engineering at Stanford University. Member of the Apache Software Foundation, V.P Apache Gora, PMC Apache Nutch, PMC Apache Any23. Previous mentor in Google Summer of Code 2012 program. OASIS Legal Document ML TC member. Lewis currently participates in several open source projects and... Read More →


Tuesday June 11, 2013 3:30pm - 4:30pm
Marina Room Conference Center Fort Mason, San Francisco, CA

3:30pm

When Bad Things Happen to Good Data: A Deep Dive Into How Cassandra Resolves Inconsistent Data
This talk focuses Cassandra's anti-entrpoy mechanisms. Jason will discuss the details of read repair, hinted handoff, node repair, and more as they aide in reolving data that has become inconsistent across nodes. In addition, he'll provide insight into how those techniques are used to ensure data consistency at Netflix.

Speakers
JB

Jason Brown

Senior Software Engineer and Apache Cassandra Committer, Netflix
Jason Brown is a Senior Software Engineer at Netflix, as well as being an Apache Cassandra Committer. He holds a Master’s degree in Music Composition is searching for time to write a second string quartet.


Tuesday June 11, 2013 3:30pm - 4:30pm
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

3:30pm

Dude, Where's My Tweet? Taming the Twitter Firehose
Gnip ingests and must serve out hundreds of millions of social activities every day and social platforms are only growing. This makes the scalability of applications essential for Gnip. Enter Cassandra. Problem solved, right? Not exactly, Gnip's relationship with Cassandra was not all rainbows and unicorns. In this session we will walk you through why we began looking at Cassandra as a data store in the first place and the valuable lessons we with Cassandra that has made it an invaluable part of our infrastructure.

Speakers
AN

Andrew Noonan

Software Engineer, Gnip
Andrew is a software engineer with Gnip in Boulder, Colorado. He has helped the high powered engineering team at Gnip do real-time filtering and provide historical access to the entire twitter corpus for a little over a year now. Andrew was part of the team that first began stress testing cassandra with the large amounts of data that they see and experienced first hand the fact that it is not always rainbows and unicorns when it comes to shinny... Read More →


Tuesday June 11, 2013 3:30pm - 4:30pm
Fleet Room - Building D Fort Mason, San Francisco, CA

4:00pm

Real World, Real Time Data Modeling
Data modeling for Cassandra presents a new set of challenges, especially for developers with a background in relational data modeling. And there are added complexities in modeling for analytic applications which need to enable statistical functions over the data, but a good data model, exploiting Cassandra's strengths, can make all the difference to a successful project. This tutorial will examine a number of real-world customer data modeling examples and draw out some hints and tips that will benefit hnot just the Cassandra newbie, but also the more experienced data modeler.

Speakers
TM

Tim Moreton

CTO, Acunu
As a co-founder and CTO of Acunu Tim was instrumental in delivering Acunu Analytics, the leading Cassandra-based operational intelligence platform. Tim was previously a senior member of the technical team at Tideway (now BMC), where he led the creation of solutions for managing data centers at Fortune 500 clients. Previously he was CEO of a consultancy delivering data solutions for the aviation sector. He holds a PhD in Computer Science from... Read More →


Tuesday June 11, 2013 4:00pm - 4:30pm
Festival Pavilion Fort Mason, San Francisco, CA

5:15pm

The Next Great Data Developer Contest
Tuesday June 11, 2013 5:15pm - 6:15pm
Festival Pavilion Fort Mason, San Francisco, CA

6:15pm

Happy Hour
Tuesday June 11, 2013 6:15pm - 6:30pm
Festival Pavilion Fort Mason, San Francisco, CA
 
Wednesday, June 12
 

7:00am

Open Registration
Wednesday June 12, 2013 7:00am - 9:00am
Festival Pavilion Fort Mason, San Francisco, CA

7:45am

Networking Breakfast
Wednesday June 12, 2013 7:45am - 9:15am
Festival Pavilion Fort Mason, San Francisco, CA

9:30am

DataStax Certification
Wednesday June 12, 2013 9:30am - 10:30am
Building C370 Fort Mason, San Francisco, CA

9:30am

Taking Risks Without Risking Your Career
Let's face it, these key technology decisions are about way more than just technology. The stakes are high, and your career is on the line. Play it safe, and you could become irrelevant. But taking risks is risky business. These two choices are always in tension, and nobody understands this balance better than Netflix, a company with a history of bold, calculated risk taking.

Speakers
BB

Billy Bosworth

CEO, DataStax
Billy is responsible for the strategy, explosive growth, and day-to-day operations of DataStax. He has 20 years of experience in the database industry in roles ranging from DBA to senior executive. Prior to DataStax, Billy spent 6 years at Quest Software, a provider of systems management software, where his most recent role was VP and GM of the database business unit. Under his leadership, the industry-leading Quest database business grew from... Read More →
CK

Christos Kalantzis

Engineering Manager of Cloud Persistence Engineering, Netflix
A humanitarian at heart, I try to focus my efforts in creating technology that enhances our lives. I have built and lead teams at YouSendIt and Netflix to scale out their persistence layers, create a cloud file system and to adopt Apache Cassandra as a scalable and highly available data solution. I've worked as a DB2, SQL Server and MySQL DBA for over 10 years and through, sometimes painful, trial and error I have learned the advantages and... Read More →


Wednesday June 12, 2013 9:30am - 10:30am
Firehouse Building Fort Mason, San Francisco, CA

9:30am

Large Queries in Real-Time for Enterprise
The ability to manipulate and query very large datasets in realtime is a pressing need for most large data enterprises. Recently, we've seen an explosion of tools such as Impala or Druid, but all of these tools suffer from single points of failure or can't deliver the sub 1 second query times necessary for realtime results. Together we'll explore how to break down these seemingly intractable problems. We'll learn how to build horizontally scalable query engines with Cassandra, capable of sub-second query times across multi-billion row datasets.

Speakers
MS

Matt Stump

Senior Backend Engineer, KISSMetrics
Senior Backend Engineer for KISSMetrics, DataStax MVP 2012, and author of the forthcoming "Cassandra: A Beginner's Guide" published by Packt.


Wednesday June 12, 2013 9:30am - 10:30am
Marina Room Conference Center Fort Mason, San Francisco, CA

9:30am

No moving parts. Taking advantage of pure speed.
Flash Memory technology, deployed as server-side PCIe or solid state disks (SSDs), is emerging as a critical tool for performance and efficiency in data centers of all scales. This presentation will discuss how the use of Flash impacts Cassandra deployments in terms of configuration, DRAM requirements and performance expectations. Ideas on leveraging C*'s cutting-edge data-center awareness to blend flash and disk storage nodes for cost and workload efficiency will also be shared. Flash media itself will be examined from a physical perspective to understand endurance issues. Data on write amplification under bulk-load and operational workload conditions will be presented to explain the impact to Flash of C*'s Log Structured Merge Tree architecture and the associated compactions. Finally, we will examine strategies to make Cassandra more Flash-aware using both conventional techniques as well as emerging Non-volatile memory (NVM) programming capabilities. Lessons learned from real-world customer deployments will be shared to complete this presentation.

Speakers
MK

Matt Kennedy

Big Data Solutions Architect, Fusion-IO
Matt Kennedy is a Solutions Architect at Fusion-IO. He works with customers to apply the performance benefits of low-latency server-side Flash technology to NoSQL databases, Hadoop clusters and other Big Data technologies. Matt has been a Cassandra user and occasional contributor since version 0.7 and is a co-organizer of the Cassandra meetup group in the Washington DC area.


Wednesday June 12, 2013 9:30am - 10:30am
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

9:30am

Hindsight is 20/20. MySQL to Cassandra
Abstract A brief intro to how Barracuda Networks uses Cassandra and the ways in which they are replacing their MySQL infrastructure, with Cassandra. This presentation will include the lessons they've learned along the way during this migration.

Speakers
MK

Michael Kjellman

Software Engineer, Barracuda Networks
Michael Kjellman is a Software Engineer, from San Francisco, working at Barracuda Networks. Michael works across multiple products, technologies, and languages. He primarily works on Barracuda's spam infrastructure and web filter classification data.


Wednesday June 12, 2013 9:30am - 10:30am
Festival Pavilion Fort Mason, San Francisco, CA

9:30am

No Whistling Required: Cabs, Cassandra, and Hailo
Hailo has leveraged Cassandra to build one of the most successful startups in European history. This presentations looks at how Hailo grew from a simple MySQL-backed infrastructure to a resilient Cassandra-backed system running in three data centres globally. Topics covered include: the process of migration, experience running multi-DC on AWS, common data modeling patterns and security implications for achieving PCI compliance.

Speakers
DG

Dave Gardner

Senior Engineer, Hailo
Dave Gardner is part of the services team at Hailo - the taxi app company. He has helped build the Hailo platform that brings together taxi drivers and passengers in 6 cities across the globe and counting, focussing on availability and resilience. Dave has worked with Apache Cassandra since 2010 and is the founder of the Cassandra London user group.


Wednesday June 12, 2013 9:30am - 10:30am
Fleet Room - Building D Fort Mason, San Francisco, CA

10:30am

Break
Wednesday June 12, 2013 10:30am - 11:00am
Fort Mason Fort Mason, San Francisco, CA

11:00am

Remember Me! Session Clustering with Cassandra
In this session Les Hazlewood, the Apache Shiro PMC Chair, will cover Shiro's enterprise session management capabilities, how it can be used across any application (not just web or JEE applications) and how to use Cassandra as Shiro's session store, enabling a distributed session cluster supporting hundreds of thousands or even millions of concurrent sessions. As a working example, Les will show how to set up a session cluster in under 10 minutes using Cassandra. If you need to scale user session load, you won't want to miss this!

Speakers
LH

Les Hazlewood

Co-Founder & CTO and Apache Shiro PMC Chair, Stormpath
Prior to forming Stormpath, Les held senior architectural positions at Bloomberg and Delta Airlines and he was former CTO of a software engineering firm supporting educational and government agencies. Les has been actively involved in Open Source for more than 10 years, committing or contributing to projects like the Spring Framework, JBoss, and Apache Shiro. Les has a BS in Computer Science from Georgia Tech, and practices Kendo and studies... Read More →


Wednesday June 12, 2013 11:00am - 11:30am
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

11:00am

DataStax Certification
Wednesday June 12, 2013 11:00am - 12:00pm
Building C370 Fort Mason, San Francisco, CA

11:00am

Are You Flexible Enough? Thoughts on the Future of Big Data and the Cloud
Is there a "right" role for the cloud in your company's future? Are there too many risks and costs associated with public clouds? Is hybridization of on premises and cloud a viable strategy? In this session, we will explore the pros and cons of cloud architecture as it relates to your big data projects.

Speakers
GS

Gurvinder Singh Ahluwalia

CTO for Cloud Computing, IBM
Gurvinder Singh Ahluwalia is the CTO for Cloud Computing across LOB's at IBM covering the West US market. In this role, he serves strategy and architecture leadership in the field to customers, represents 60-70% of the cloud portfolio at IBM, and creates heterogeneous, open source, cloud based solutions and business models for clients. During Distributed Computing era, Guri immersed in design-deploy-run of the second largest routed network in the... Read More →
BS

Biri Singh

Technology Thought Leader, N/A
Zorawar Biri Singh was recently SVP and GM, HP Converged Cloud and HP Cloud Services. In this role, he oversaw HP‚ global cloud portfolio & strategy and held overall responsibility for HP public cloud infrastructure, platform services, cloud solutions and cloud ecosystem. Singh has 22 years of global leadership experience as an executive and serial entrepreneur in enterprise IT, online advertising, e-commerce and communications. He... Read More →


Wednesday June 12, 2013 11:00am - 12:00pm
Firehouse Building Fort Mason, San Francisco, CA

11:00am

Practice Makes Perfect: Extreme Cassandra Optimization
Ooyala has been using Apache Cassandra since version 0.4. Our data ingest volume has exploded since 0.4 and Cassandra has scaled along with us. Al will cover many topics from an operational perspective on how to manage, tune, and scale Cassandra in a production environment.

Speakers
AT

Albert Tobey

Tech Lead, Compute and Data Services, Ooyala
Al Tobey is Tech Lead of the Compute and Data services team at Ooyala. His team develops and operates Ooyala's internal big data platform, consisting of Apache Cassandra, Hadoop, and internally developed tools. When not in front of a computer, Al is a father, husband, and trombonist.


Wednesday June 12, 2013 11:00am - 12:00pm
Fleet Room - Building D Fort Mason, San Francisco, CA

11:00am

The World's Next Top Data Model
You know you need Cassandra for it's uptime and scaling, but what about that data model? Let's bridge that gap and get you building your game changing app. We'll break down topics like storing objects and indexing for fast retrieval. You will see by understanding a few things about Cassandra internals, you can put your data model in the spotlight. The goal of this talk is to get you comfortable working with data in Cassandra throughout the application lifecycle. What are you waiting for? The cameras are waiting!

Speakers
PM

Patrick McFadin

Principal Solutions Architect, DataStax
Patrick McFadin is a Principle Solutions Architect at DataStax, the commercial company behind Apache Cassandra. Prior to his work at DataStax he was the Chief Architect at Hobsons, an education services company. He obtained a BS in Computer Engineering from Cal Poly, San Luis Obispo and holds the distinction of being the only recipient of a medal (as anyone can find out) for hacking while serving in the US Navy.


Wednesday June 12, 2013 11:00am - 12:00pm
Festival Pavilion Fort Mason, San Francisco, CA

11:00am

Cassandra in Cloud Foundry
Speakers
RK

Renat Khasanshyn

Founder and CEO, Altoros
Renat Khasanshyn is CEO of Altoros Systems and Venture Partner at Runa Capital. Employing one of the largest pools of NoSQL/Hadoop/Cloud Foundry engineering expertise on the market, Altoros provides vendor-independent professional services, consulting and integration to service providers, software vendors and enterprises. Mr. Khasanshyn is also founder of Apatar, an open source data integration toolset, founder of... Read More →


Wednesday June 12, 2013 11:00am - 12:00pm
Marina Room Conference Center Fort Mason, San Francisco, CA

11:30am

Adaptive Data Convergence for Life Sciences
The Life Sciences industry is undergoing significant changes in how companies do business due to recent legislative changes. The evolving landscape is forcing the Pharmaceutical companies to change how their entire Sales model and move from Prescriber based sales model to an Account based sales model, which has a downstream impact on Sales team organization, Field sales alignment, Incentive compensation and Marketing. This changing reality also requires that the Pharmaceutical companies understand and drive the changes to business strategy on insights driven by data about Prescription Sales, Medication Adherence, Claims, etc. to name a few categories of data sources. The required insights are derived from the convergence of data from multiple sources that include numerous internal applications, 3rd party data sources and social media. In this session, learn how Reltio is helping various Pharmaceutical companies cope with the evolving business landscape with a data driven strategy by leveraging the Reltio data science engine that runs on Cassandra.

Speakers
MS

Manish Sood

CEO & Founder, Reltio
Manish is the founder and CEO at Reltio, a software startup that transforms related data into business opportunities by providing applications that are powered by the convergence of data from multiple data sources that include internal applications, 3rd party data feeds and social media. Reltio data convergence applications enable business users to understand markets, drive revenue and reduce risk through actionable, real-time and predictive... Read More →


Wednesday June 12, 2013 11:30am - 12:00pm
Marina Room Conference Center Fort Mason, San Francisco, CA

11:30am

Buy It Now! Cassandra at eBay
This session will cover use cases for Cassandra at eBay. It'll start with some simple logging & tracking use cases and move into a more complex use case called "eBay Social Signal", which enables like/own/want social-oriented features on eBay product and item pages. For each use case, Jay will cover in-depth data model design with trade-offs, deployment topology, and lessons learned. To conclude, Jay will summarize the best practices that guide Cassandra utilization at eBay.

Speakers
JP

Jay Patel

Technical Architect, eBay
Jay is a hands-on architect within Data Infrastructure & Services at eBay. He architects data storage behind eBay.com, which can scale up to hundred billion database calls every day. He is currently leading the evaluation of NoSQL and adoption of Cassandra for various eBay projects, and is developing NoSQL knowledge bases and best practices. In the not too distant past, he built the first version of eBay‚Äôs cloud manager platform. He also... Read More →


Wednesday June 12, 2013 11:30am - 12:00pm
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

12:00pm

Lunch
Wednesday June 12, 2013 12:00pm - 1:10pm
Festival Pavilion Fort Mason, San Francisco, CA

1:10pm

Large Scale Data Ingestion, Processing and Analysis: Then, Now & Future
The presentation aims to highlight the challenges posed by large scale and near real-time data processing problems. In past, such problems were solved using conventional technologies, primarily a database and JMS queue. However these solutions had their limits and presented serious problems in terms of scale and redundancy. The new breed of products - a la Cassandra & Kafka, being innately distributed in their design, aim to tackle such challenges in a very elegant manner. The presentation will showcase some of the use cases of this genre from the industry and describe the solutions which have been increasing in their sophistication.

Speakers
AC

Ameet Chaubal

Technologist, Accenture
Ameet Chaubal is a technologist within Accenture's Emerging Technology practice. He has over 15 years of experience in all aspects of computing and lately has been focusing on solving big data storage and computation problems. He is interested in identifying the limits of conventional technology and adapting the bleeding edge solutions to fill the gaps.
FI

Fausto Inestroza

Architect, Accenture
Fausto Inestroza is an architect at the Accenture Technology Labs, the dedicated R&D organization of Accenture. His focus is on data platforms, analytical processes and distributed systems.


Wednesday June 12, 2013 1:10pm - 1:40pm
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

1:10pm

Searching for a Needle in a Big Data Haystack
The presentation demonstrates how Solr may be used to create real-time analytics applications. In addition, Datastax Enterprise 3.0 will be showcased, which offers Solr version 4.0 with a number of improvements over the previous DSE release. A realtime financial application will run for the audience, and then a detailed look at how the application was built. An overview of Datastax Enterprise Solr features will be given, and how the many enhancements in DSE make it unique in the marketplace.

Speakers
JR

Jason Rutherglen

Senior Big Data Engineer, DataStax
Jason works at Datastax as a senior Big Data engineer architecting, developing and supporting the Datastax Enterprise product line which includes Solr integrated with Cassandra. His career has involved an array of technologies including search, Hadoop, Hive, mobile phones, cryptography, and natural language processing. Jason has been developing solutions with Lucene and Solr for more than 7 years and is a co-author of 'Programming Hive' and... Read More →


Wednesday June 12, 2013 1:10pm - 1:40pm
Fleet Room - Building D Fort Mason, San Francisco, CA

1:10pm

Data Driven Retail
How do you keep up with the velocity and variety of data streaming in from all the smart devices that run the physical environments of 7,000+ stores? What about getting analytics that tell you exactly where energy waste is happening in real-time? In this talk, Riptide IO, describes their blueprint for collecting, organizing and deriving real-time operational intelligence from smart devices such as lighting, HVAC, sensors and more. Learn how this retailer gained a dramatic boost to their sustainability program, and solved some of the major bottlenecks in managing countless devices across thousands of stores.

Speakers
DL

David Leimbrock

CTO, Riptide IO
David is responsible for the technology vision and software engineering at Riptide IO. Dave is passionate about making software that customers love to use. He has spent the past 10 years leading engineering teams that are focused on communicating with and managing intelligent machines. Previously, Dave worked in the Emerging Technology Group at Cisco Systems - developing solutions for their Smart+Connected Buildings Group.


Wednesday June 12, 2013 1:10pm - 1:40pm
Marina Room Conference Center Fort Mason, San Francisco, CA

1:10pm

The Darwinism of Data Management Systems: the evolution required to support evolving data types
There is no denying that the world of data has changed dramatically. If your ecosystem does not evolve, you can end up in a barren wasteland. Conversely, making the right choices can lead to a future of bounty. Peter was one of the first analysts to see the potential impact of big data on traditional vendors. In this session, he will share insights that will help you evolve your business in fundamental and lasting ways.

Speakers
PG

Peter Goldmacher

Managing Director and Senior Research Analyst, Cowen and Company
Peter Goldmacher is a Managing Director and Senior Research Analyst. He joined Cowen in 2004 and has experience working in the computer industry as well as on Wall Street. He started his career at Oracle, working for six years in variety of departments including sales ops, consulting, marketing, and finance, and he has also worked at BMC Software as Director, Corporate Planning and Strategy. Mr. Goldmacher had a brief stint at a start-up... Read More →


Wednesday June 12, 2013 1:10pm - 2:10pm
Firehouse Building Fort Mason, San Francisco, CA

1:10pm

Lightning Talks
John Wrobel, Director at SanDisk
Scaling Cassandra on SSDs

Yuki Morishita, Apache Cassandra Committer & Software Engineer at DataStax
How to Contribute to Cassandra

Nate McCall, Development Lead at Apigee
Adding Your Own Thrift Method in 5 Minutes

Yue Cathy Chang, Sr. Director of Business Development at Impetus
Impetus: Proven Practices in Leveraging Big Data's Competitive Advantage

Eyal Reuveni, Software Engineer at Eventbrite
Cassandra at Eventbrite

Joey Jablonski, Director of Product Management at Dell
Redefining Security for Big Data

Brian Hawkins, Senior Software Engineer at Proofpoint
KairosDB: Bob's Story

Joaquin Casares, Software Engineer at DataStax
Introduction to DataStax Enterprise

C. Scott Andreas, Engineer at Boundary

Praveen Kumar, Engineering Manager at Equinix
Equinix: Big Data Platform

Jeremy Hanna, Senior Support Engineer at DataStax
Troubleshooting Cassandra

Speakers
CS

C. Scott Andreas

Engineer, Boundary
JC

Joaquin Casares

Software Engineer, DataStax
YC

Yue Cathy Change

Sr. Director of Business Development, Impetus
JH

Jeremy Hanna

Senior Support Engineer, DataStax
JJ

Joey Jablonski

Director of Product Management, Kitenga Analytics, Dell
Joey is Director of Product Management at Kitenga Analytics and is currently focused on the architecture and strategy for the deployment of complex analytic technologies including the Kitenga Analytic Suite. Joey leads Dell's strategy and product activities related to advanced analytics for unstructured data. Joey's technical interests include Big Data, high performance computing, low-latency networking technologies and information security. Joey... Read More →
PK

Praveen Kumar

Engineering Manager, Equinix
Praveen developed Equinix's big data strategy and is currently leading architecture and the roll-out of Equinix – Big Data Platform. Praveen has 14 years of experience in building high quality distributed software systems requiring high performance, scalability and short response time.
NM

Nate McCall

Development Lead, Apigee
YM

Yuki Morishita

Apache Cassandra Committer & Software Engineer, DataStax
ER

Eyal Reuveni

Software Engineer, Eventbrite
JW

John Wrobel

John Wrobel joined SanDisk when the company acquired Schooner Information Technology. As director for enterprise solutions, he is instrumental in sales operations and market development activities. Prior to joining Schooner, he served as a software sales manager at IBM, where he was responsible for meeting the sales goals of the business through effective planning and budgeting. Wrobel holds a B.S. in Computer Engineering from Lehigh... Read More →


Wednesday June 12, 2013 1:10pm - 2:10pm
Festival Pavilion Fort Mason, San Francisco, CA

1:10pm

Meet the Experts
Wednesday June 12, 2013 1:10pm - 5:00pm
Building C370 Fort Mason, San Francisco, CA

1:40pm

Cassandra Internals
Speakers
AM

Aaron Morton

Apache Cassandra Committer, TheLastPickle
Aaron Morton is a Freelance Developer based in New Zealand, and a Committer on the Apache Cassandra project. In 2010 he gave up the RDBMS world for the scale and reliability of Cassandra. He now spends his time advancing the Cassandra project and helping others get the best out of it.


Wednesday June 12, 2013 1:40pm - 2:10pm
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

1:40pm

Data as Competitive Advantage in Manufacturing
Almost 10 years ago in a hotel room in Asia , his first parser was born. That parser and its offspring have supported the development of world-class networking products at Brocade. This discussion will include how big data will change manufacturing, the essential ingredients for success in greenfield big data projects, and what it's like to be obsessed with quality.

Speakers
RH

Rich Hammel

Director of Advanced Manufacturing, Brocade Communications
Rich has spent the last 18 years leading engineering teams in development of spacecraft, elevators, microfluidics, biotechnology and networking gear. Rich enjoys spending time teaching, coaching, rough housing and learning from his two young sons.


Wednesday June 12, 2013 1:40pm - 2:10pm
Fleet Room - Building D Fort Mason, San Francisco, CA

1:40pm

Can't we all just get along? MariaDB and Cassandra
The Cassandra Storage Engine allows access to data in a Cassandra cluster from MariaDB. Learn what the Cassandra Storage Engine is and how to make use of it, how we implemented it using dynamic columns in MariaDB. Also, we'll look at CQL, data and command mapping, use cases and benchmarks.

Speakers
CC

Colin Charles

Chief Evangelist, Monty Program Ab
Colin Charles works at Monty Program Ab, on MariaDB. He lives in Kuala Lumpur, Malaysia and had worked at MySQL since 2005. Before joining MySQL, he worked actively on the Fedora and OpenOffice.org projects. He's spoken at many conferences - linux.conf.au, The MySQL Conference & Expo, foss.in, to name a few.


Wednesday June 12, 2013 1:40pm - 2:10pm
Marina Room Conference Center Fort Mason, San Francisco, CA

2:10pm

Guided Discussion
Wednesday June 12, 2013 2:10pm - 2:40pm
Firehouse Building Fort Mason, San Francisco, CA

2:10pm

Real-Time Big Data with Storm, Cassandra, and In-Memory Computing
This session will describe how to resolve the processing limitations by placing the streaming and data store interfaces in-memory as well, through an in-memory computing platform, and also how to resolve the complexity challenge by implementing a DevOps approach that abstracts all the underlying infrastructure and provides single-click management of all the application tiers and services, on any environment (private/public cloud, bare metal…). And the best news is that all this optimization can be implemented seamlessly, with no code change to your apps.

Speakers
DF

DeWayne Filppi

Senior Architect, GigaSpaces
Software Engineer/Architect/Consultant with over 25 years broad industry experience.  Current role is assisting GigaSpaces customers with architecting and developing systems in the areas of in-memory and cloud computing.  Particularly focused on elastic real time big data systems.


Wednesday June 12, 2013 2:10pm - 2:40pm
Fleet Room - Building D Fort Mason, San Francisco, CA

2:10pm

Splunk + Cassandra = New value to business
The session will demonstrate Splunk integration with Cassandra today and discuss more concepts for an integrations to come in the future.

Speakers
ES

Eddie Satterly

Chief Big Data Evangelist, Splunk
Mr. Satterly has served in a variety of roles including developer, engineer, architect and CTO over his 23 year career for Fortune 500 companies. He has been a long time Splunk fan and user. More recently he was able to revolutionize the way Expedia delivers their core web applications that resulted in improved user experience. At Expedia, he created a highly scalable and flexible Splunk environment and as a result was able to retire 35 other... Read More →


Wednesday June 12, 2013 2:10pm - 2:40pm
Festival Pavilion Fort Mason, San Francisco, CA

2:10pm

Time is Money
This session will focus on our approach to building a scalable TimeSeries database for financial data using Cassandra 1.2 and CQL3. We will discuss how we deal with a heavy mix of reads and writes as well as how we monitor and track performance of the system.

Speakers
JL

Jake Luciani

Apache Cassandra Committer and Quantitative Strategist, BlueMountain Capital Management
Jake Luciani is a committer on Apache Cassandra and Apache Thrift. He works at BlueMountain Capital in NYC building a next generation market data database on Cassandra.
CY

Carl Yeksigian

Quantitative Strategist, BlueMountain Capital Management
Carl graduated from Carnegie Mellon University with a BS in Computer Science. He works as a Quantitative Strategist at BlueMountain Capital in New York City, where he has spent the past year working with Cassandra and comparing other big data solutions to Cassandra.


Wednesday June 12, 2013 2:10pm - 2:40pm
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

2:10pm

Aligning Technology Infrastructure With Horizontal Business Growth
During this presentation Stefan Piesche, Chief Technology Officer at Constant Contact, will discuss how he and his team were able to grow and scale Constant Contact’s technology infrastructure by aligning technology with horizontal business growth to improve performance and reduce costs. He will share some of the lessons learned, best practices, and recommendations for other technology executives looking to transform their technology infrastructure to business.

Speakers
SP

Stefan Piesche

Chief Technology Officer, Constant Contact
As Chief Technology Officer, Piesche focuses on evaluating new technologies to determine how they can help expand the Constant Contact's products and service architecture to accommodate future growth and increased performance. To do this, he combines technology vision with practical delivery capabilities and engineering best practices. Prior to Constant Contact, Piesche spent eight years at the Cobalt Group, where he was executive director of... Read More →


Wednesday June 12, 2013 2:10pm - 2:40pm
Marina Room Conference Center Fort Mason, San Francisco, CA

2:40pm

Break
Wednesday June 12, 2013 2:40pm - 3:10pm
Fort Mason Fort Mason, San Francisco, CA

3:10pm

Stepping Through the Lifecycle of a Service Offering with Cassandra
It's a fine line to walk for incorporating new technologies in an organization with 15+ years of legacy software. In this presentation, we'll look at the lifecycle and adoption of Cassandra from a skunkworks project to a full fledged service in a legacy organization.

Speakers
CM

Chris "Mac" McEniry

Systems Architect, Sony Network Entertainment
Mac has been a Systems Architect inside of the operations team at Sony Network Entertainment for the last 3 years. He's spent 13 years building online presences, keeping them running, and helping others get their jobs done. In his spare time, he runs the DefCon network.


Wednesday June 12, 2013 3:10pm - 3:40pm
Fleet Room - Building D Fort Mason, San Francisco, CA

3:10pm

Analytic Fabric, The Future of Analytic Computing
Once you figure out how to capture every bit of data possible, then what? When it comes to business decisions, the first step is to stop thinking about 'data' and 'big data' as two different things. The smartest companies are the ones who leverage all their data to make intelligent, timely decisions. At Jaspersoft, Brian gets to see the best, and worst, business intelligence practices of companies around the world. In this session he will share some industry-wide insights that you can apply to your own unique challenges.

Speakers
BG

Brian Gentile

Chairman and CEO, Jaspersoft
Brian Gentile brings a successful, 27-year technology track record to Jaspersoft, helping it to become the open source business intelligence market leader, measured by commercial size and growth, production deployments of its software, the size and vibrancy of its community, and product downloads. Brian joined Jaspersoft as its first independent Board member in 2005 and then as CEO in 2007.


Wednesday June 12, 2013 3:10pm - 4:10pm
Firehouse Building Fort Mason, San Francisco, CA

3:10pm

In Case of Emergency, Break Glass
The design of Apache Cassandra allows applications to provide constant uptime. Peer-to-Peer technology ensures there are no single points of failure, and the Consistency guarantees allow applications to function correctly while some nodes are down. There is also a wealth of information provided by the JMX API and the system log. All of this means that when things go wrong you have the time, information and platform to resolve them without downtime. This presentation will cover some of the common, and not so common, performance issues, failures and management tasks observed in running clusters. I'll discuss how to gather information and how to act on it. Operators, Developers and Managers will all benefit from this exposition of Cassandra in the wild.

Speakers
AM

Aaron Morton

Apache Cassandra Committer, TheLastPickle
Aaron Morton is a Freelance Developer based in New Zealand, and a Committer on the Apache Cassandra project. In 2010 he gave up the RDBMS world for the scale and reliability of Cassandra. He now spends his time advancing the Cassandra project and helping others get the best out of it.


Wednesday June 12, 2013 3:10pm - 4:10pm
Festival Pavilion Fort Mason, San Francisco, CA

3:10pm

The State of CQL
Abstract Since its inception, the Cassandra Query Language (CQL) has grown and matured, resulting in the 3rd version of the language (CQL3) being finalized in Cassandra 1.2. Compared to the legacy Thrift API, CQL3 aims at providing an API that is higher level and more user friendly but still fully assumes the distributed nature of Cassandra and it's storage engine. This presentation will present CQL3, describing the reasoning and goals behind the language as well as the language itself. CQL's relationship with Thrift will be touched on, along with the CQL binary protocol that has been introduced in Cassandra 1.2. This presentation will wrap up by discussing the future of CQL.

Speakers
SL

Sylvain Lebresne

Apache Cassandra Committer and Engineer, DataStax
Sylvain works at DataStax on the Apache Cassandra project for which he is a core contributor, committer and PMC member. He has, amongst other things, played a prominent role in the development of CQL3 and its binary protocol. In a former life, he obtained a Ph.D. in Computer Science from Paris 7's University.


Wednesday June 12, 2013 3:10pm - 4:10pm
Marina Room Conference Center Fort Mason, San Francisco, CA

3:10pm

CMB: An Open Message Bus for the Cloud
The Comcast Silicon Valley Innovation Center has developed a general purpose message bus for the cloud. The service is API compatible with Amazon's SQS/SNS and is built on Cassandra and Redis with the goal of linear horizontal scalability. This presentation offers and in-depth look at the architecture of the system and how they employ Cassandra as a central component to meet key requirements. Latest feature enhancements and performance data will also be covered.

Speakers
BW

Boris Wolf

Lead Engineer CMB Project, Comcast Silicon Valley Innovation Center
Boris has more than 10 years of experience working for various technology startups in the Bay Area and is currently the engineering lead for the CMB project at the Comcast Silicon Valley Innovation Center.


Wednesday June 12, 2013 3:10pm - 4:10pm
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

3:40pm

Cassandra at Instagram
Cassandra is a critical part of Instagram's large scale site infrastructure that supports more than 100 million active users. This talk is a practical deep dive into data models, systems architecture, and challenges encountered during the implementation process.

Speakers
RB

Rick Branson

Infrastructure Engineer, Instagram
Rick is an Infrastructure Engineer at Instagram where he spends his days building scalable systems to deal with their meteoric growth rates. Rick is a totally impatient dude, so he’s spent the last 10 years obsessed with making complex production systems faster and more reliable. Previously he was a Solutions Architect at DataStax, and has held various engineering roles at Coroutine, Mosaik Solutions, and FedEx.


Wednesday June 12, 2013 3:40pm - 4:10pm
Fleet Room - Building D Fort Mason, San Francisco, CA

4:10pm

Comparing Architectures: Cassandra vs the Field
Have you wondered what actually happens when you submit a write to Cassandra? This vendor agnostic technical talk will cover the internals of the read and write paths of Cassandra and compare it to other NoSQL stores, especially HBase so you can pick the right database for your project. Some of the topics mentioned are consistency levels, memtables/memstores, SSTables/HFiles, bloom filters, block indexes, data distribution partitioners and optimal use cases.

Speakers
SF

Sameer Farooqui

Freelance Big Data Consultant and Trainer, Self
Sameer is Marakana's Big Data specialist, with deep industry expertise in the Hadoop domain. For the past five years, he has deployed various clustering software packages internationally to clients including fortune 500, governments, hospitals and banks.


Wednesday June 12, 2013 4:10pm - 5:00pm
Golden Gate Room Conference Center Fort Mason, San Francisco, CA

4:10pm

Java and .NET Client Drivers - Cassandra development on fire
Cassandra 1.2 finalizes CQL3 and introduces a new binary protocol for client/server communication. These two components are the foundation of the new line of drivers developed by DataStax. Based on years of experience with Cassandra, these new drivers for Java, .Net and Python come with an asynchronous and lightweight architecture, a clean and simple API, a standardized way to discover nodes and to manage load balancing and fail over. This presentation will give an in depth look at these new drivers which will make your Cassandra-based applications even more robust, efficient and simple to write.

Speakers
MF

Michael Figuiere

Developer Advocate, DataStax
Michael is an engineer and a developer advocate at DataStax where he actively works to improve Cassandra. At ease with both Enterprise Java and lower level technologies, he specializes in distributed architectures and topics such as NoSQL, search engines, and data processing. He often speaks about NoSQL in conferences and User Groups and loves to write about his favorite topics.
PM

Patrick McFadin

Principal Solutions Architect, DataStax
Patrick McFadin is a Principle Solutions Architect at DataStax, the commercial company behind Apache Cassandra. Prior to his work at DataStax he was the Chief Architect at Hobsons, an education services company. He obtained a BS in Computer Engineering from Cal Poly, San Luis Obispo and holds the distinction of being the only recipient of a medal (as anyone can find out) for hacking while serving in the US Navy.


Wednesday June 12, 2013 4:10pm - 5:00pm
Marina Room Conference Center Fort Mason, San Francisco, CA

4:10pm

How Not to Use Cassandra
At Spotify, we see failure as an opportunity to learn. During the two years we've used Cassandra in our production environment, we have learned a lot. This session touches on some of the exciting design anti-patterns, performance killers and other opportunities to lose a finger that are at your disposal with Cassandra.

Speakers
AL

Axel Liljencrantz

Backend Developer, Spotify
Axel Liljencrantz is a backend infrastructure engineer at Spotify, where he works with improving the scalability and stability of the Spotify backend. Axel enjoys holding long-winded monologues on almost any subjects except about himself.


Wednesday June 12, 2013 4:10pm - 5:00pm
Fleet Room - Building D Fort Mason, San Francisco, CA

4:10pm

Cassandra at eBay Scale
We have seen rapid adoption of C* at eBay in past two years. We have made tremendous efforts to integrate C* into existing database platforms, including Oracle, MySQL, Postgres, MongoDB, XMP etc.. We also scale C* to meet business requirement and encountered technical challenges you only see at eBay scale, 100TB data on hundreds of nodes. We will share our experience of deployment automation, managing, monitoring, reporting for both Apache Cassandra and DataStax enterprise.

Speakers
AJ

Anurag Jambhekar

Senior Manager of Database Infrastructure, eBay
Anurag Jambhekar is Sr. Manager, Database Infrastructure at ebay where his team is responsible for 1000+ heterogeneous databases supporting the high volume and concurrency of ebay.com. He is obsessive about getting the best performance from application, databases and systems. He likes solving problems around scalability and performance and very excited about the innovation happening in the data and database arena in last decade .Currently, he... Read More →
FQ

Feng Qu

Principal DBA, eBay
Started with Oracle 5. Feng has worked with RDBMS in past 20 years at various high tech companies, including DoubleClick, Yahoo, Intuit and eBay. In recent years, he likes NoSQL technology and quickly became a huge Cassandra fan while working at eBay. Now he is responsible for companywide Cassandra projects from design to deployment.


Wednesday June 12, 2013 4:10pm - 5:00pm
Festival Pavilion Fort Mason, San Francisco, CA