<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0"><channel><atom:link rel="hub" href="http://tumblr.superfeedr.com/" xmlns:atom="http://www.w3.org/2005/Atom"/><description>Apache Hadoop User Group in the Raleigh/Durham/Chapel Hill area of North Carolina, USA</description><title>Triangle Hadoop Users Group</title><generator>Tumblr (3.0; @trihug)</generator><link>http://www.trihug.org/</link><item><title>Slides from May 22nd talk by David Arthur</title><description>&lt;p&gt;&lt;div class="post_title"&gt;Thanks to David Arthur of &lt;a href="http://www.lucidimagination.com/"&gt;Lucid Imagination&lt;/a&gt; for his presentation on Apache ZooKeeper!&lt;/div&gt;
&lt;div class="post_title"&gt;&lt;/div&gt;
&lt;div id="__ss_13060152"&gt;&lt;object height="355" id="__sse13060152" width="425"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=trihug-may222012-120524083135-phpapp01&amp;amp;rel=0&amp;amp;stripped_title=introduction-to-zookeeper-trihug-may-22-2012&amp;amp;userName=mumrah"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;param name="wmode" value="transparent"&gt;&lt;/object&gt;&lt;/div&gt;
&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6=" type="text/javascript"&gt;&lt;/script&gt;&lt;/p&gt;</description><link>http://www.trihug.org/post/23676433044</link><guid>http://www.trihug.org/post/23676433044</guid><pubDate>Thu, 24 May 2012 12:41:00 -0400</pubDate></item><item><title>Intro to Apache Zookeeper</title><description>&lt;p&gt;ZooKeeper - An Introduction and Practical Use Cases&amp;#8221;&lt;/p&gt;
&lt;p&gt;Speaker: David Arthur, Lucid Imagination&lt;/p&gt;
&lt;p&gt;Date: May 22nd, 6:30PM&lt;/p&gt;
&lt;p&gt;Location: Bronto Software, Durham, NC&lt;/p&gt;
&lt;p&gt;RSVP: &lt;a href="http://trihug-05-2012.eventbrite.com/"&gt;&lt;a href="http://trihug-05-2012.eventbrite.com/"&gt;http://trihug-05-2012.eventbrite.com/&lt;/a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Abstract: ZooKeeper is a distributed coordination service with a strong emphasis on update consistency. It can be used for simple things like configuration management and distributed id assignment, as well as more complex things like distributed locking and service discovery. The API provided by ZooKeeper is rather low-level and requires a bit of boilerplate code, so we will also look at some higher level frameworks.&lt;/p&gt;
&lt;p&gt;David Arthur is a software engineer at Lucid Imagination working on the new &amp;#8220;Big Data&amp;#8221; team. He has been working for the last two years building distributed systems where Hadoop has been a central component. Prior to working in the &amp;#8220;big data&amp;#8221; space, he focused mainly on data side of applications: schema optimization, data warehousing, etc. He attended Florida State University where he received a B.S. in Physics and completed two years of graduate studies in Scientific Computing.&lt;/p&gt;</description><link>http://www.trihug.org/post/23477090313</link><guid>http://www.trihug.org/post/23477090313</guid><pubDate>Mon, 21 May 2012 08:41:00 -0400</pubDate></item><item><title>Next Meeting: IBM Watson on April 12, 2012 @ Bronto Software</title><description>&lt;p&gt;&lt;strong&gt;Title&lt;/strong&gt;: IBM Watson: Big Data Text Analytics&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Location&lt;/strong&gt;: &lt;a href="http://www.bronto.com/"&gt;Bronto Software&lt;/a&gt; in Durham, NC&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Speaker&lt;/strong&gt;: John Gerken&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;RSVP&lt;/strong&gt;: &lt;a href="http://trihug-04-2012.eventbrite.com/"&gt;&lt;a href="http://trihug-04-2012.eventbrite.com/"&gt;http://trihug-04-2012.eventbrite.com/&lt;/a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Abstract&lt;/strong&gt;: What is IBM Watson? It managed to defeat two previously undefeated human opponents in a game show and it has shown how super computers are becoming more and more able to understand and answer questions in ways previously reserved for the domain of human thought.  But what most don&amp;#8217;t realize is how the synergy between Big Data and text analytics was integral to enabling IBM Watson&amp;#8217;s capabilities. Curious as to how Watson leveraged Big Data and text analytics technologies? Wondering what the future may hold for applying them?  If so, attend &lt;a href="http://www.trihug.org/"&gt;TriHUG&lt;/a&gt; on April 12 to find out.&lt;span class="s1"&gt; &lt;/span&gt; &lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Speaker Bio&lt;/strong&gt;: John Gerken is a Senior Software Architect in IBM&amp;#8217;s Emerging Technologies jStart Team, where he is responsible for recognizing, promoting and developing prototypes of software technologies and trends that could positively impact IBM&amp;#8217;s customers. John is an IBM Watson Team Leader working to enable Watson to be used by customers.  He is also a recognized thought leader in the area of Situational Applications and mashup ecosystems and is a principle evangelist for these technologies. John is a member of the North Carolina Technical Experts Council (NC TEC), which is an &lt;a href="http://www-03.ibm.com/ibm/academy/index.html"&gt;IBM Academy&lt;/a&gt; affiliated technical advisory and vitality organization serving the RTP, NC area.  He also holds a Bachelors of Science in Jazz Performance and plays at every opportunity.&lt;span class="s2"&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span class="s2"&gt;&lt;br/&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span class="s2"&gt;&lt;strong&gt;Sponsors&lt;/strong&gt;: &lt;a href="http://www.bronto.com/"&gt;Bronto Software&lt;/a&gt;, &lt;a href="http://www.lucidimagination.com/"&gt;Lucid Imagination&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;</description><link>http://www.trihug.org/post/20479441949</link><guid>http://www.trihug.org/post/20479441949</guid><pubDate>Wed, 04 Apr 2012 15:38:24 -0400</pubDate></item><item><title>Slides from March 20 talk by Jameson Lopp</title><description>&lt;p&gt;Thanks to Jameson Lopp of &lt;a href="http://www.bronto.com"&gt;Bronto Software&lt;/a&gt; for his presentation on Pratical Pig.  Here are the slides:&lt;/p&gt;
&lt;div id="__ss_11710651"&gt;&lt;strong&gt;&lt;a href="http://www.slideshare.net/trihug/practical-pig" title="Pratical Pig"&gt;Pratical Pig&lt;/a&gt;&lt;/strong&gt; &lt;object height="355" id="__sse11710651" width="425"&gt;&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=practicalpig-120323103120-phpapp02&amp;amp;stripped_title=practical-pig&amp;amp;userName=trihug"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;param name="wmode" value="transparent"&gt;&lt;/object&gt;
&lt;div&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/trihug"&gt;trihug&lt;/a&gt;.&lt;/div&gt;
&lt;/div&gt;
&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6=" type="text/javascript"&gt;&lt;/script&gt;</description><link>http://www.trihug.org/post/19784633284</link><guid>http://www.trihug.org/post/19784633284</guid><pubDate>Fri, 23 Mar 2012 11:50:23 -0400</pubDate></item><item><title>Slides from Feb 16. talk by Adam Gugliciello of Datameer</title><description>&lt;p&gt;Thanks to Adam Gugliciello of &lt;a href="http://www.datameer.com"&gt;Datameer&lt;/a&gt; for the talk at TriHUG last week!  Here are the slides:&lt;/p&gt;

&lt;div id="__ss_11710651"&gt;&lt;strong&gt;&lt;a href="http://www.slideshare.net/trihug/financial-services-trihug" title="Financial services trihug"&gt;Financial services trihug&lt;/a&gt;&lt;/strong&gt;
&lt;object height="355" id="__sse11710651" width="425"&gt;
&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=financialservicestrihug-120222172653-phpapp01&amp;amp;stripped_title=financial-services-trihug&amp;amp;userName=trihug"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;param name="wmode" value="transparent"&gt;&lt;/object&gt;
&lt;div&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/trihug"&gt;trihug&lt;/a&gt;.&lt;/div&gt;
&lt;/div&gt;
&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;</description><link>http://www.trihug.org/post/18100994520</link><guid>http://www.trihug.org/post/18100994520</guid><pubDate>Wed, 22 Feb 2012 19:55:28 -0500</pubDate><category>hadoop</category><category>finance</category><category>big data</category><category>datameer</category></item><item><title>Next Meeting: Feb. 16 @ Bronto Software</title><description>&lt;p&gt;&lt;strong&gt;Title:&lt;/strong&gt; &lt;strong&gt;Financial Data Analytics with Hadoop&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sponsored By: &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&lt;a href="http://www.datameer.com" title="Datameer" target="_blank"&gt;&lt;img alt="Datameer" src="http://datameer.com/fileadmin/templates/datameer/images/logo.png"/&gt;&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://trihug-feb2012.eventbrite.com/"&gt;RSVP here&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Abstract:&lt;/strong&gt; &lt;/p&gt;
&lt;p class="p1"&gt;Hadoop based applications are becoming critical in the financial services arena for the analysis and correlation of large volumes of structured and unstructured data.  In addition, the Dodd-Frank Act signifies the largest US financial regulatory change in several decades and requires much greater transparency on financial data.  In this session, we will answer common questions and demonstrate use cases in how Hadoop and Datameer help with asset management and risk management, fraud detection and data security.   &lt;/p&gt;
&lt;p class="p1"&gt;Leave this session knowing about:&lt;/p&gt;
&lt;ul class="ul1"&gt;&lt;li class="li1"&gt;Financial data and Hadoop. What data lends itself to Hadoop? What doesn&amp;#8217;t?&lt;/li&gt;
&lt;li class="li1"&gt;Benchmarks from real-world uses of Hadoop in finance&lt;/li&gt;
&lt;li class="li1"&gt;How to effectively migrate, manage, and analyze financial data using Hadoop&lt;/li&gt;
&lt;/ul&gt;&lt;p class="p3"&gt;&lt;strong&gt;Bio: &lt;/strong&gt;Adam Gugliciello, a 15-year veteran in Software Engineering and Systems Architecture specializes in highly available, parallel systems. Most recently he has been developing grid computing solutions to enable deep analyses and intelligence gathering on huge software systems for technical debt and functional mapping. Adam is a Solution Engineer at Datameer and helps bring Financial and Telco applications expertise to the utilization of the Datameer business intelligence suite.&lt;/p&gt;</description><link>http://www.trihug.org/post/16862661457</link><guid>http://www.trihug.org/post/16862661457</guid><pubDate>Wed, 01 Feb 2012 08:37:00 -0500</pubDate></item><item><title>Slides from Intro to HBase presentation January 2012</title><description>&lt;div id="__ss_11140474"&gt;Thanks to &lt;a href="http://twitter.com/chrisshain"&gt;Chris Shain&lt;/a&gt; from &lt;a href="http://www.tresata.com/"&gt;Tresata&lt;/a&gt; for coming to Durham last night to talk about HBase.&lt;/div&gt;
&lt;p&gt;&lt;br/&gt;&lt;br/&gt;&lt;/p&gt;
&lt;div&gt;&lt;strong&gt;&lt;a href="http://www.slideshare.net/trihug/intro-to-apache-hbase-by-chris-shain-of-tresata" title="TriHUG January 2012 Talk by Chris Shain" target="_blank"&gt;TriHUG January 2012 Talk by Chris Shain&lt;/a&gt;&lt;/strong&gt; 
&lt;object height="355" id="__sse11140474" width="425"&gt;
&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=hbase-120118124521-phpapp01&amp;amp;stripped_title=intro-to-apache-hbase-by-chris-shain-of-tresata&amp;amp;userName=trihug"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;param name="wmode" value="transparent"&gt;&lt;/object&gt;
&lt;/div&gt;
&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;</description><link>http://www.trihug.org/post/16070488775</link><guid>http://www.trihug.org/post/16070488775</guid><pubDate>Wed, 18 Jan 2012 14:04:00 -0500</pubDate></item><item><title>Next Meeting: January 17, 2012 @ Bronto Software</title><description>&lt;p&gt;Title: Intro to Apache HBase by Chris Shain of Tresata&lt;/p&gt;
&lt;p&gt;Location: Bronto Software in Durham, NC&lt;/p&gt;
&lt;p&gt;&lt;a href="http://trihug-01-2012.eventbrite.com"&gt;RSVP&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Abstract: Chris will provide an introduction to &lt;a href="http://hbase.apache.org"&gt;Apache HBase&lt;/a&gt;, aiming to discuss:&lt;/p&gt;
&lt;ol&gt;&lt;li&gt;What is HBase? (High level overview)&lt;/li&gt;
&lt;li&gt;Details of the HBase architecture&lt;/li&gt;
&lt;li&gt;How do clients interact with HBase?&lt;/li&gt;
&lt;li&gt;Some general HBase patterns and anti-patterns&lt;/li&gt;
&lt;li&gt;What are the use cases for HBase vs. Relational DB?&lt;/li&gt;
&lt;/ol&gt;&lt;p&gt;Bio: Chris Shain is the software development lead at Tresata, a provider of Big Data solutions for the financial industry in Charlotte NC. His background includes 7+ years of software development experience in the financial services industry, with a focus on customer-facing data management applications and data warehousing. Lately he works with Hadoop and HBase on data volumes in the multi-terabyte range, and tinkers with geographic information systems. He lives in Charlotte NC, and can be reached at &lt;a href="mailto:chris@tresata.com"&gt;&lt;span class="s1"&gt;chris@tresata.com&lt;/span&gt;&lt;/a&gt; or twitter @ChrisShain.&lt;/p&gt;</description><link>http://www.trihug.org/post/15529857053</link><guid>http://www.trihug.org/post/15529857053</guid><pubDate>Sun, 08 Jan 2012 16:54:00 -0500</pubDate></item><item><title>Slides from Alan Gates Presentation on Nov. 15, 2011</title><description>&lt;p&gt;Thanks to Alan Gates of &lt;a href="http://www.hortonworks.com"&gt;Hortonworks&lt;/a&gt; for the two excellent presentations on Apache Pig and Apache HCatalog. Links to the slides for the two talks are included below and are also available on Slideshare.&lt;/p&gt;
&lt;div id="__ss_10223469"&gt;&lt;strong&gt;&lt;a href="http://www.slideshare.net/trihug/trihug-november-pig-talk-by-alan-gates" title="TriHUG November Pig Talk by Alan Gates"&gt;TriHUG November Pig Talk by Alan Gates&lt;/a&gt;&lt;/strong&gt; 
&lt;object height="355" id="__sse10223469" width="425"&gt;
&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=trihugpignov2011-111118141510-phpapp01&amp;amp;stripped_title=trihug-november-pig-talk-by-alan-gates&amp;amp;userName=trihug"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;param name="wmode" value="transparent"&gt;&lt;/object&gt;
&lt;div&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/trihug"&gt;trihug&lt;/a&gt;.&lt;/div&gt;
&lt;div&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;div&gt;&lt;/div&gt;
&lt;div&gt;&lt;/div&gt;
&lt;div&gt;&lt;/div&gt;
&lt;div id="__ss_10223502"&gt;&lt;strong&gt;&lt;a href="http://www.slideshare.net/trihug/trihug-november-hcatalog-talk-by-alan-gates" title="TriHUG November HCatalog Talk by Alan Gates"&gt;TriHUG November HCatalog Talk by Alan Gates&lt;/a&gt;&lt;/strong&gt;
&lt;object height="355" id="__sse10223502" width="425"&gt;
&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=trihughcatnov2011-111118141802-phpapp01&amp;amp;stripped_title=trihug-november-hcatalog-talk-by-alan-gates&amp;amp;userName=trihug"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;param name="wmode" value="transparent"&gt;&lt;/object&gt;
&lt;div&gt;View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/trihug"&gt;trihug&lt;/a&gt;.&lt;/div&gt;
&lt;/div&gt;</description><link>http://www.trihug.org/post/12977313480</link><guid>http://www.trihug.org/post/12977313480</guid><pubDate>Fri, 18 Nov 2011 15:20:38 -0500</pubDate></item><item><title>Slides from Oct. 11 TriHUG meeting featuring Josh Patterson of Cloudera</title><description>&lt;p&gt;&lt;strong&gt;&lt;a title="OSCON Data 2011 - Lumberyard" target="_blank" href="http://www.slideshare.net/jpatanooga/oscon-data-2011-lumberyard"&gt;OSCON Data 2011 - Lumberyard&lt;/a&gt;&lt;/strong&gt; &lt;iframe src="http://www.slideshare.net/slideshow/embed_code/9476155" width="425" height="355" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"&gt;&lt;/iframe&gt; View more &lt;a target="_blank" href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a target="_blank" href="http://www.slideshare.net/jpatanooga"&gt;Josh Patterson&lt;/a&gt;&lt;/p&gt;</description><link>http://www.trihug.org/post/11363542728</link><guid>http://www.trihug.org/post/11363542728</guid><pubDate>Wed, 12 Oct 2011 15:53:37 -0400</pubDate></item><item><title>Next Meeting: November 15, 2011 @ Bronto Software</title><description>&lt;p&gt;Our next meeting will be November 15 at Bronto Software.  The speaker will be Alan Gates, the author of Programming Pig and a member of the &lt;a href="http://www.hortonworks.com"&gt;Hortonworks&lt;/a&gt; team.  RSVP &lt;a href="http://trihug-nov.eventbrite.com"&gt;here&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&amp;#8212;&amp;#8212;&amp;#8212;&amp;#8212;-&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;Title:  New Features in Pig 0.9 and  Introducing HCatalog&lt;/p&gt;
&lt;p class="p1"&gt;Abstract:  Pig 0.9 added several features to make Pig a more powerful data processing platform, including macros, include statements, and the ability to embed Pig in Python for control flow.  We&amp;#8217;ll cover these, talk about some new features that have been added since 0.9, and what&amp;#8217;s next on Pig&amp;#8217;s roadmap.&lt;/p&gt;
&lt;p class="p1"&gt;HCatalog is a table management and storage management layer for Hadoop that enables users with different data processing tools – Pig, MapReduce, Hive, Streaming – to more easily read and write data on the grid. HCatalog’s table abstraction presents users with a relational view of data in the Hadoop distributed file system (HDFS) and ensures that users need not worry about where or in what format their data is stored – RCFile format, text files, sequence files.  This talk will include an overview of HCatalog&amp;#8217;s features and a discussion of its current roadmap.&lt;/p&gt;
&lt;p class="p1"&gt;Bio:  Alan is a co-founder of Hortonworks as well as an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan also designed HCatalog and guided its adoption as an Apache Incubator project. Alan has a BS in Mathematics from Oregon State University and a MA in Theology from Fuller Theological Seminary. He is also the author of Programming Pig, a forthcoming book from O’Reilly Press. Follow Alan on Twitter: @alanfgates.&lt;/p&gt;</description><link>http://www.trihug.org/post/11351679901</link><guid>http://www.trihug.org/post/11351679901</guid><pubDate>Wed, 12 Oct 2011 08:18:00 -0400</pubDate></item><item><title>TriHUG Next Meeting featuring Josh Patterson of Cloudera set for Oct. 11</title><description>&lt;p&gt;The next Triangle Hadoop User Group meeting will be October 11th at Bronto Software and will be featuring Josh Patterson of Cloudera.  &lt;a href="http://trihug-oct-11.eventbrite.com"&gt;RSVP here&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;span&gt;
&lt;p&gt;Title: Lumberyard: Time series Indexing at Scale&lt;/p&gt;
&lt;p&gt;Abstract: &lt;/p&gt;
&lt;p class="p1"&gt;As time series data explodes in volume in the genomic, sensor, and&lt;/p&gt;
&lt;p class="p1"&gt;financial realms [1] companies are looking for more effective ways to&lt;/p&gt;
&lt;p class="p1"&gt;store and query this data. To handle this explosion in scale systems&lt;/p&gt;
&lt;p class="p1"&gt;are looking to the Hadoop, HBase, and NoSQL domain for components to&lt;/p&gt;
&lt;p class="p1"&gt;build their systems on. In this talk we introduce Lumberyard [3], a&lt;/p&gt;
&lt;p class="p1"&gt;system which can potentially (1) store Terabytes of time series data&lt;/p&gt;
&lt;p class="p1"&gt;and allow for this data to be interactively queried at low latencies&lt;/p&gt;
&lt;p class="p1"&gt;to provide real time access. Lumberyard stores iSAX [4] indexes in&lt;/p&gt;
&lt;p class="p1"&gt;HBase&amp;#8217;s Multi-dimensional sorted map storage system which give&lt;/p&gt;
&lt;p class="p1"&gt;Lumberyard the reliability of HDFS yet the low latencies of HBase. Our&lt;/p&gt;
&lt;p class="p1"&gt;approach leverages a multidimensional indexing structure which is&lt;/p&gt;
&lt;p class="p1"&gt;stored in HBase&amp;#8217;s highly available distributed multi-dimensional&lt;/p&gt;
&lt;p class="p1"&gt;sorted map. We present the design of Lumberyard&amp;#8217;s implementation and&lt;/p&gt;
&lt;p class="p1"&gt;illustrate the differences between an in-memory iSAX index compared&lt;/p&gt;
&lt;p class="p1"&gt;with a persisted HBase-backed iSAX index.&lt;/p&gt;

&lt;p class="p1"&gt;Sponsored by &lt;a href="http://cloudera.com/"&gt;Cloudera&lt;/a&gt; and &lt;a href="http://www.bronto.com/"&gt;Bronto Software&lt;/a&gt;.&lt;/p&gt;

&lt;p class="p1"&gt;More info at &lt;a href="http://www.trihug.org/"&gt;&lt;a href="http://www.trihug.org"&gt;www.trihug.org&lt;/a&gt;&lt;/a&gt;.&lt;/p&gt;

&lt;p class="p1"&gt;Bio:&lt;/p&gt;

&lt;p class="p1"&gt;Master’s Thesis: self-organizing mesh networks Published in IAAI-09:&lt;/p&gt;
&lt;p class="p1"&gt;TinyTermite: A Secure Routing Algorithm&lt;/p&gt;

&lt;p class="p1"&gt;Conceived, built, and led Hadoop integration for the openPDC project&lt;/p&gt;
&lt;p class="p1"&gt;at TVA (Smartgrid stuff). Led small team which designed classification&lt;/p&gt;
&lt;p class="p1"&gt;techniques for timeseries and Map Reduce. Open source work at&lt;/p&gt;
&lt;p class="p3"&gt;&lt;span class="s1"&gt;&lt;a href="http://openpdc.codeplex.com/"&gt;&lt;a href="http://openpdc.codeplex.com"&gt;http://openpdc.codeplex.com&lt;/a&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class="p1"&gt;Now: Sr. Solutions Architect at Cloudera&lt;/p&gt;
&lt;/span&gt;&lt;/p&gt;</description><link>http://www.trihug.org/post/10200106608</link><guid>http://www.trihug.org/post/10200106608</guid><pubDate>Wed, 14 Sep 2011 08:26:45 -0400</pubDate></item><item><title>Slides from Ted Dunning's Sept. 2011 talk</title><description>&lt;p&gt;Thanks to everyone for attending last night&amp;#8217;s talk!  Ted&amp;#8217;s slides are available for download below.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;a title="MapR, Implications for Integration" target="_blank" href="http://www.slideshare.net/trihug/mapr-implications-for-integration"&gt;MapR, Implications for Integration&lt;/a&gt;&lt;/strong&gt; &lt;iframe src="http://www.slideshare.net/slideshow/embed_code/9253909" width="425" height="355" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"&gt;&lt;/iframe&gt; View more &lt;a target="_blank" href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a target="_blank" href="http://www.slideshare.net/trihug"&gt;trihug&lt;/a&gt;&lt;/p&gt;</description><link>http://www.trihug.org/post/10199032705</link><guid>http://www.trihug.org/post/10199032705</guid><pubDate>Wed, 14 Sep 2011 07:15:48 -0400</pubDate></item><item><title>Starfish Talk Slides from April 2011</title><description>&lt;p&gt;Under the better late than never category, here are the slides from the April 2011 TriHUG meeting on Starfish.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;a title="Starfish: A Self-tuning System for Big Data Analytics" href="http://www.slideshare.net/gsingers/starfish-a-selftuning-system-for-big-data-analytics"&gt;Starfish: A Self-tuning System for Big Data Analytics&lt;/a&gt;&lt;/strong&gt;
&lt;object id="__sse9191801" width="425" height="355"&gt;
&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=trihugtalkpublic-110909074008-phpapp01&amp;amp;stripped_title=starfish-a-selftuning-system-for-big-data-analytics&amp;amp;userName=gsingers"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse9191801" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=trihugtalkpublic-110909074008-phpapp01&amp;amp;stripped_title=starfish-a-selftuning-system-for-big-data-analytics&amp;amp;userName=gsingers" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;
View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/gsingers"&gt;gsingers&lt;/a&gt;.&lt;/p&gt;
&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;</description><link>http://www.trihug.org/post/9992942191</link><guid>http://www.trihug.org/post/9992942191</guid><pubDate>Fri, 09 Sep 2011 08:42:13 -0400</pubDate></item><item><title>Next Meeting: Sept. 13 @ Bronto Software</title><description>&lt;p&gt;We trust everyone has had a good summer and is equally excited to get back into learning more about Apache Hadoop and scaling.  Our next meeting will be Sept. 13 at Bronto Software.  Food and drinks start at 6:30 and the talks start at 7.&lt;/p&gt;
&lt;p class="p1"&gt;We are pleased to announce that our speaker will be Ted Dunning from &lt;a target="_blank" href="http://mapr.com"&gt;MapR Technologies&lt;/a&gt;.   See below for more details.  Please &lt;a target="_blank" href="http://trihug-9-2011.eventbrite.com/"&gt;RSVP here&lt;/a&gt;.&lt;/p&gt;
&lt;p class="p2"&gt;&lt;strong&gt;Title&lt;/strong&gt;: MapR, Architecture and Implications&lt;/p&gt;
&lt;p class="p1"&gt;&lt;strong&gt;Abstract&lt;/strong&gt;:&lt;/p&gt;
&lt;p class="p1"&gt;The talk will be a description of how MapR&amp;#8217;s architectural advances allow significant improvements in speed, reliability and scalability over stock Hadoop.  This will include a dive into the MapR file system and a discussion of how the map-reduce layer has been changed and the impact on other Hadoop eco-system components.  This will include actual test results.&lt;/p&gt;
&lt;p class="p1"&gt;In the second section of my talk, I will describe how this new architecture has surprising consequences.  In particular, I will show how tasks like machine learning, data visualization and search indexing can all work better on the MapR platform.&lt;/p&gt;
&lt;p class="p1"&gt;&lt;strong&gt;Ted&amp;#8217;s Bio&lt;/strong&gt;:&lt;/p&gt;
&lt;p class="p1"&gt;Ted has held Chief Scientist positions at Veoh Networks, ID Analytics and at MusicMatch, (now Yahoo Music). Ted is responsible for building the most advanced identity theft detection system on the planet, as well as one of the largest peer-assisted video distribution systems and ground-breaking music and video recommendations systems. Ted has 15 issued and 15 pending patents and contributes to several Apache open source projects including Hadoop, Zookeeper and Hbase. He is also a committer for Apache Mahout. Ted earned a BS degree in electrical engineering from the University of Colorado; a MS degree in computer science from New Mexico State University; and a Ph.D. in computing science from Sheffield University in the United Kingdom. Ted also bought the drinks at one of the very first Hadoop User Group meetings.&lt;/p&gt;</description><link>http://www.trihug.org/post/9512860582</link><guid>http://www.trihug.org/post/9512860582</guid><pubDate>Sun, 28 Aug 2011 15:46:23 -0400</pubDate></item><item><title>RTP Scaling Hackathon (Planning Stages)</title><description>&lt;p&gt;Some TriHUG members are in the early stage of putting together an all day  hackathon on all things scaling (Hadoop, Cassandra, Hive, Pig, Mahout,  etc.) and wanted to get some info out to the community as well as a call for  volunteers and sponsors.&lt;/p&gt;
&lt;p&gt;The basic gist of the day is that  we get together and spend the day hacking and learning about writing  scalable, fault tolerant systems.  All ranges of experience are welcome  and we fully expect that one of the groups that forms will be a  &amp;#8220;tutorial&amp;#8221; group, while other groups will be doing more advanced things.   The key is to get lots of interaction and cross-fertilization of  ideas.&lt;/p&gt;
&lt;p&gt;Our tentative plan is that we will make available:&lt;/p&gt;
&lt;p&gt;1.  Compute Cluster time (likely Amazon EC2) along with ready to use  instances w/ appropriate things already installed.  (More later) &lt;br/&gt; 2. Some public data sets, but feel free to bring your own publicly available on Amazon S3&amp;#160;&lt;br/&gt; 3. Food, drinks, etc. including pizza/beer at the end &lt;br/&gt; 4. Network connectivity &lt;br/&gt; 5. Space to work in &lt;br/&gt; 6. (TBD) Machine to submit jobs using a fair scheduler&lt;/p&gt;
&lt;p&gt;You  need to bring your laptop and an open mind.  Also having your favorite  tools on your machine would be good.  A github account or something  similar would also be useful.&lt;/p&gt;
&lt;p&gt;Our likely date for this is  June 18th with a backup date of June 25 (pending space availability) from 9  AM - 6 (?) PM.    Attendance will require RSVP and we will send out  sign up info later.  For now, we are targeting it to be free (including  EC2 compute time), but that is predicated on us getting sponsorships to  cover costs, so if you think you or your company can sponsor, please let  us know ASAP.&lt;/p&gt;
&lt;p&gt;Tentative Schedule (strawman):   &lt;br/&gt; 8:30:  Doors open/networking/coffee/snacks &lt;br/&gt; 9 - 9:30: Idea pitches and Seed Projects announced and teams formed &amp;#8212;   people can stand up and say what they are interested in and then we  imagine people can team up based on their interest &amp;#8212; for instance, I  will probably work on Mahout and machine learning &lt;br/&gt; 9:30 - 12: Hack &lt;br/&gt; 12-1: Food/networking/hacking &lt;br/&gt; 1-5(?): Hack &lt;br/&gt; 5-6 (no firm cut off time): Share what you learned to the group over pizza and drinks.  Demo if you have one.  &lt;/p&gt;
&lt;p&gt;How you can help:&lt;/p&gt;
&lt;p&gt;-  Help us get data sets organized and a Chef/Puppet recipe setup with all  the appropriate tools/languages/SCM/etc.  Also, think of interesting  problems to work on. &lt;br/&gt; - Sponsor food/coffee/drinks/t-shirts/compute time/ etc.  Please contact Grant Ingersoll at info@trihug.org.   I don&amp;#8217;t think we are talking about a super lot of money here (maybe  $1000-1500 total?  &amp;#8212; more on this as things develop) &lt;br/&gt; - Let us know  you are interested, the more we hear from sooner, the better we can  plan space accordingly.  Please reply on this list if you are  interested. &lt;br/&gt; - Are you graphically capable?  Help us design a t-shirt. &lt;br/&gt; - Once we firm up some details, help us spread the word&lt;/p&gt;</description><link>http://www.trihug.org/post/5402118219</link><guid>http://www.trihug.org/post/5402118219</guid><pubDate>Wed, 11 May 2011 18:29:16 -0400</pubDate><category>hackathon</category><category>hadoop</category><category>cassandra</category><category>mahout</category><category>hive</category><category>pig</category><category>scaling</category><category>hackathon</category><category>raleigh</category><category>durham</category><category>chapel hill</category><category>NC</category></item><item><title>Next TriHUG: Monday April 4th</title><description>&lt;p&gt;Out next meeting will be &lt;strong&gt;Monday, April 4th&lt;/strong&gt; at Bronto. Food and drinks at &lt;strong&gt;6:30pm&lt;/strong&gt;. Talk starts at 7:00pm.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;*** Note: this is a Monday, not our usual Tuesday ***&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;br/&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Title:&lt;/strong&gt; Starfish: A Self-tuning System for Big Data Analytics&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Presenter:&lt;/strong&gt; &lt;a href="http://www.cs.duke.edu/~shivnath/"&gt;Shivnath Babu&lt;/a&gt;, Duke University&lt;/p&gt;
&lt;p&gt;Shivnath Babu, assistant professor of Computer Science at Duke University, will help demystify Hadoop performance tuning. Practical tips on tuning Hadoop for specific workloads will be discussed. Details will be provided on the the Starfish research project: a system for self-tuning big-data analytics. &lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;a href="http://www.eventbrite.com/event/1470844335"&gt;RSVP HERE&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;</description><link>http://www.trihug.org/post/4010614430</link><guid>http://www.trihug.org/post/4010614430</guid><pubDate>Mon, 21 Mar 2011 18:41:30 -0400</pubDate></item><item><title>February 2011 Meeting - Apache Mahout: Driving the Yellow Elephant</title><description>&lt;p&gt;Out next meeting will be Tuesday February 1st at Bronto. Food and drinks at 6:30pm. Talk starts at 7:00pm.&lt;/p&gt;
&lt;p&gt;&lt;img src="http://media.tumblr.com/tumblr_lfazk7R2xw1qb45ba.png"/&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Apache Mahout: Driving the Yellow Elephant&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://mahout.apache.org/"&gt;Apache Mahout&lt;/a&gt; co-founder and committer, &lt;a href="http://www.grantingersoll.com/"&gt;Grant Ingersoll&lt;/a&gt; will give an introduction to Apache Mahout and machine learning.  We will also spend some time looking at how Mahout leverages Apache Hadoop to implement a scalable clustering algorithm.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;a href="http://www.eventbrite.com/event/1227255755"&gt;REGISTER HERE&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Help spread the word! Print out this flyer and post it at your office, school, local coffee shop.&lt;/p&gt;
&lt;p&gt;&lt;a href="http://dl.dropbox.com/u/177932/feb%202011%20trihug%20flyer.pdf"&gt;&lt;img src="http://media.tumblr.com/tumblr_lfbsemXZNV1qb45ba.png"/&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;br/&gt;&lt;/strong&gt;&lt;/p&gt;</description><link>http://www.trihug.org/post/2837190533</link><guid>http://www.trihug.org/post/2837190533</guid><pubDate>Wed, 19 Jan 2011 23:10:00 -0500</pubDate></item><item><title>December 2010 Meeting Followup</title><description>&lt;p&gt;&lt;img src="http://media.tumblr.com/tumblr_ldmh9byhTS1qb45ba.jpg"/&gt;&lt;/p&gt;
&lt;p&gt;We closed out the year with a talk by Brian O&amp;#8217;Connor ( seen above at the MacBook Air ). He outlined how UNC Lineberger Comprehensive Cancer Center is using Hadoop and HBase in research. Once again &lt;a href="http://www.bronto.com"&gt;Bronto&lt;/a&gt; graciously hosted us and provided food and drinks. Slides for Brian&amp;#8217;s talk and the New and Noteworthy segment are below. &lt;/p&gt;
&lt;p&gt;2010 was a good year for Hadoop in the Triangle. Looking forward to 2011.&lt;/p&gt;
&lt;p&gt;-&lt;a href="http://www.twitter.com/ryancox"&gt;ryan&lt;/a&gt;&lt;/p&gt;






&lt;p&gt;
&lt;object id="__sse6216018" width="425" height="355"&gt;
&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=20101207oconnortrihughbasetalk-101217143914-phpapp01&amp;amp;rel=0&amp;amp;stripped_title=20101207-o-connortrihughbasetalk&amp;amp;userName=ryancox"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse6216018" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=20101207oconnortrihughbasetalk-101217143914-phpapp01&amp;amp;rel=0&amp;amp;stripped_title=20101207-o-connortrihughbasetalk&amp;amp;userName=ryancox" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;
&lt;object id="__sse6078191" width="425" height="355"&gt;
&lt;param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=december2010trihugnews-key-101208085614-phpapp02&amp;amp;stripped_title=hadoop-new-and-note-december-2010-trihug&amp;amp;userName=ryancox"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed name="__sse6078191" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=december2010trihugnews-key-101208085614-phpapp02&amp;amp;stripped_title=hadoop-new-and-note-december-2010-trihug&amp;amp;userName=ryancox" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;
&lt;/p&gt;
&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;&lt;script src="http://b.scorecardresearch.com/beacon.js?c1=7&amp;amp;c2=7400849&amp;amp;c3=1&amp;amp;c4=&amp;amp;c5=&amp;amp;c6="&gt;&lt;/script&gt;</description><link>http://www.trihug.org/post/2359249957</link><guid>http://www.trihug.org/post/2359249957</guid><pubDate>Sat, 18 Dec 2010 07:08:17 -0500</pubDate><category>hadoop</category><category>hbase</category><category>trihug</category></item><item><title>December TriHUG Meeting</title><description>&lt;p&gt;&lt;br/&gt;We will be talking &lt;a href="http://hbase.apache.org/"&gt;HBase&lt;/a&gt; and biotech. Come join us at &lt;strong&gt;&lt;span mce_fixed="1" mce_style="font-weight: bold;" mce_name="strong"&gt;6:30pm&lt;/span&gt;&lt;/strong&gt; on &lt;strong&gt;&lt;span mce_fixed="1" mce_style="font-weight: bold;" mce_name="strong"&gt;Tuesday December 7th&lt;/span&gt;&lt;/strong&gt; at &lt;a href="http://www.bronto.com/"&gt;Bronto&lt;/a&gt;. Food and drinks at 6:30pm. Talk starts at 7:00pm.&lt;/p&gt;
&lt;p class="p1"&gt;&lt;span class="s1"&gt;&lt;strong&gt;&lt;span mce_fixed="1" mce_style="font-weight: bold;" mce_name="strong"&gt;Brian O’Connor&lt;/span&gt; &lt;/strong&gt;from the &lt;a href="http://www.unclineberger.org/"&gt;UNC Lineberger Comprehensive Cancer Center&lt;/a&gt; will discuss their work using HBase and Hadoop MapReduce to store and query information from large cancer resequencing projects.  He will provide an overview of HBase along with the problems they are working on. The relative merits of the technology will be explored in addition to alternative approaches. &lt;/span&gt;&lt;/p&gt;
&lt;p class="p1"&gt;&lt;span class="s1"&gt;&lt;br/&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class="p1"&gt;&lt;strong&gt;&lt;a href="http://www.eventbrite.com/event/1031692821"&gt;REGISTER HERE&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p class="p1"&gt;&lt;strong&gt;&lt;br/&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p class="p1"&gt;More info:&lt;/p&gt;
&lt;ul&gt;&lt;li mce_style="color: #000000;"&gt;&lt;a href="http://seqware.sf.net/"&gt;SeqWare&lt;/a&gt;&lt;/li&gt;
&lt;li mce_style="color: #000000;"&gt;&lt;a href="http://vimeo.com/16350544"&gt;HBase in Production at Facebook - Video&lt;/a&gt;&lt;/li&gt;
&lt;li mce_style="color: #000000;"&gt;&lt;a href="http://vimeo.com/14068552"&gt;Intro to HBase - Video&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://mitworld.mit.edu/video/386"&gt;Introduction to Cancer Genetics - Video&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;Help us spread the word on Twitter and Facebook or by printing out this flyer and posting it at your university or office.&lt;/p&gt;

&lt;p&gt;&lt;a href="http://dl.dropbox.com/u/177932/dec%202010%20trihug%20flyer.pdf"&gt;&lt;img src="http://media.tumblr.com/tumblr_lbmc0nycuy1qb45ba.png"/&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;&lt;/ul&gt;</description><link>http://www.trihug.org/post/1524729445</link><guid>http://www.trihug.org/post/1524729445</guid><pubDate>Tue, 09 Nov 2010 07:39:00 -0500</pubDate></item></channel></rss>

