Big Data and Hadoop in particular is currently big business, but one of the most significant problems is getting your data into Hadoop from your existing RDBMS stores. Whether you have user profile information, trades, or transactional information from your webstore in your RDBMS, analysing it often takes place within Hadoop. Dump and load techniques imply delays and intermittent replication. With Tungsten Replicator we can move data from multiple MySQL and Oracle databases directly into Hadoop giving you carbon-copies of your RDBMS data in real-time. In this presentation I'll demonstrate how the replicator achieves this including a live demo, and describe how it has helped Groupon and Booking with the data migration needs, and where we are headed in the future for supporting different tools and techniques for replication.
Survey this Session