Triangle Hadoop Users Group
6 days ago

Slides from May 22nd talk by David Arthur

Thanks to David Arthur of Lucid Imagination for his presentation on Apache ZooKeeper!

1 week ago

Intro to Apache Zookeeper

ZooKeeper - An Introduction and Practical Use Cases”

Speaker: David Arthur, Lucid Imagination

Date: May 22nd, 6:30PM

Location: Bronto Software, Durham, NC

RSVP: http://trihug-05-2012.eventbrite.com/

Abstract: ZooKeeper is a distributed coordination service with a strong emphasis on update consistency. It can be used for simple things like configuration management and distributed id assignment, as well as more complex things like distributed locking and service discovery. The API provided by ZooKeeper is rather low-level and requires a bit of boilerplate code, so we will also look at some higher level frameworks.

David Arthur is a software engineer at Lucid Imagination working on the new “Big Data” team. He has been working for the last two years building distributed systems where Hadoop has been a central component. Prior to working in the “big data” space, he focused mainly on data side of applications: schema optimization, data warehousing, etc. He attended Florida State University where he received a B.S. in Physics and completed two years of graduate studies in Scientific Computing.

1 month ago

Next Meeting: IBM Watson on April 12, 2012 @ Bronto Software

Title: IBM Watson: Big Data Text Analytics

LocationBronto Software in Durham, NC

Speaker: John Gerken

RSVPhttp://trihug-04-2012.eventbrite.com/

Abstract: What is IBM Watson? It managed to defeat two previously undefeated human opponents in a game show and it has shown how super computers are becoming more and more able to understand and answer questions in ways previously reserved for the domain of human thought.  But what most don’t realize is how the synergy between Big Data and text analytics was integral to enabling IBM Watson’s capabilities. Curious as to how Watson leveraged Big Data and text analytics technologies? Wondering what the future may hold for applying them?  If so, attend TriHUG on April 12 to find out.  

Speaker Bio: John Gerken is a Senior Software Architect in IBM’s Emerging Technologies jStart Team, where he is responsible for recognizing, promoting and developing prototypes of software technologies and trends that could positively impact IBM’s customers. John is an IBM Watson Team Leader working to enable Watson to be used by customers.  He is also a recognized thought leader in the area of Situational Applications and mashup ecosystems and is a principle evangelist for these technologies. John is a member of the North Carolina Technical Experts Council (NC TEC), which is an IBM Academy affiliated technical advisory and vitality organization serving the RTP, NC area.  He also holds a Bachelors of Science in Jazz Performance and plays at every opportunity. 


SponsorsBronto SoftwareLucid Imagination

2 months ago

Slides from March 20 talk by Jameson Lopp

Thanks to Jameson Lopp of Bronto Software for his presentation on Pratical Pig.  Here are the slides:

3 months ago

Slides from Feb 16. talk by Adam Gugliciello of Datameer

Thanks to Adam Gugliciello of Datameer for the talk at TriHUG last week!  Here are the slides:

3 months ago

Next Meeting: Feb. 16 @ Bronto Software

Title: Financial Data Analytics with Hadoop

Sponsored By: 

Datameer

RSVP here

Abstract: 

Hadoop based applications are becoming critical in the financial services arena for the analysis and correlation of large volumes of structured and unstructured data.  In addition, the Dodd-Frank Act signifies the largest US financial regulatory change in several decades and requires much greater transparency on financial data.  In this session, we will answer common questions and demonstrate use cases in how Hadoop and Datameer help with asset management and risk management, fraud detection and data security.   

Leave this session knowing about:

  • Financial data and Hadoop. What data lends itself to Hadoop? What doesn’t?
  • Benchmarks from real-world uses of Hadoop in finance
  • How to effectively migrate, manage, and analyze financial data using Hadoop

Bio: Adam Gugliciello, a 15-year veteran in Software Engineering and Systems Architecture specializes in highly available, parallel systems. Most recently he has been developing grid computing solutions to enable deep analyses and intelligence gathering on huge software systems for technical debt and functional mapping. Adam is a Solution Engineer at Datameer and helps bring Financial and Telco applications expertise to the utilization of the Datameer business intelligence suite.

4 months ago

Slides from Intro to HBase presentation January 2012

Thanks to Chris Shain from Tresata for coming to Durham last night to talk about HBase.



TriHUG January 2012 Talk by Chris Shain
4 months ago

Next Meeting: January 17, 2012 @ Bronto Software

Title: Intro to Apache HBase by Chris Shain of Tresata

Location: Bronto Software in Durham, NC

RSVP

Abstract: Chris will provide an introduction to Apache HBase, aiming to discuss:

  1. What is HBase? (High level overview)
  2. Details of the HBase architecture
  3. How do clients interact with HBase?
  4. Some general HBase patterns and anti-patterns
  5. What are the use cases for HBase vs. Relational DB?

Bio: Chris Shain is the software development lead at Tresata, a provider of Big Data solutions for the financial industry in Charlotte NC. His background includes 7+ years of software development experience in the financial services industry, with a focus on customer-facing data management applications and data warehousing. Lately he works with Hadoop and HBase on data volumes in the multi-terabyte range, and tinkers with geographic information systems. He lives in Charlotte NC, and can be reached at chris@tresata.com or twitter @ChrisShain.

6 months ago

Slides from Alan Gates Presentation on Nov. 15, 2011

Thanks to Alan Gates of Hortonworks for the two excellent presentations on Apache Pig and Apache HCatalog. Links to the slides for the two talks are included below and are also available on Slideshare.

7 months ago

Slides from Oct. 11 TriHUG meeting featuring Josh Patterson of Cloudera