Solr vs. ElasticSearch: Part 1 – Overview
A good Solr vs. ElasticSearch coverage is long overdue. We make good use of our own Search Analytics and pay attention to what people search for. Not surprisingly, lots of people are wondering when...
View ArticleSolr vs. ElasticSearch: Part 2 – Data Handling
In the previous part of Solr vs. ElasticSearch series we talked about general architecture of these two great search engines based on Apache Lucene. Today, we will look at their ability to handle your...
View ArticleLocating Mountains and More with Mahout and Public Weather Dataset
Recently I was playing with Mahout and public weather dataset. In this post I will describe how I used Mahout library and weather statistics to fill missing gaps in weather measurements and how I...
View ArticleBattle of the Giants: Apache Solr 4.0 vs ElasticSearch
Apache Solr 4.0 release is imminent and we have a heavily anticipated Solr vs. ElasticSearch blog post series going on. What better time to share that our Rafał Kuć will be giving a talk titled Battle...
View ArticleNew Tool: JMXC – JMX Console
When you are obsessed with performance and run a performance monitoring service like Sematext does, you need a quick and easy way to inspect Java apps’ MBeans in JMX. We just open-sourced JMXC, our...
View ArticleSolr vs ElasticSearch: Part 3 – Searching
In the last two parts of the series we looked at the general architecture and how data can be handled in both Apache Solr 4 (aka SolrCloud) and ElasticSearch and what the language handling capabilities...
View ArticleSolr vs ElasticSearch: Part 4 – Faceting
Solr 4 (aka SolrCloud) has just been released, so it’s the perfect time to continue our ElasticSearch vs. Solr series. In the last three parts of the ElasticSearch vs. Solr series we gave a general...
View ArticleSlides: Battle of the Giants – Solr 4.0 vs ElasticSearch 0.20.0
Slides for the Battle of the Giants talk Rafał Kuc (@kucrafal) gave at ApacheCon EU 2012 are now up! If you like working with Solr and/or ElasticSearch, or HBase, Hadoop, Kafka, Flume, etc., use and/or...
View ArticleSPM Discountorama Announcement
We are happy to announce the General Availability of SPM, our performance monitoring solution for Apache Solr, ElasticSearch, HBase, SenseiDB, and Java applications, and of course all system metrics....
View ArticleHBaseWD and HBaseHUT: Handy HBase Libraries Available in Public Maven Repo
HBaseWD is aimed to help distribute writes of records with sequential row keys in HBase (and avoid RegionServer hotspotting). Good introduction can be found here. We recently published 0.1.0 version of...
View ArticleSolr vs ElasticSearch: Part 5 – Management API Capabilities
In previous posts, all listed below, we’ve discussed general architecture, full text search capabilities and facet aggregations possibilities. However, till now we have not discussed any of the...
View ArticleSolr vs. ElasticSearch: Part 6 – User & Dev Communities
One of the questions after my talk during the recent ApacheCon EU was what I thought about the communities of the two search engines I was comparing. Not surprisingly, this is also a question we often...
View ArticlePoll: Which Solr version are you using?
With Solr 4.1 recently released, let’s see which version(s) of Solr people are using. Please tweet it to help us get more votes and better stats. Take Our Poll
View ArticlePoll: Using SolrCloud or Not?
We know that as of February 2013, of those Solr users who follow Sematext Blog about 75% use one some version of Solr 4.x. But today we are trying to get to another interesting stat: What portion of...
View ArticleMarketing Intern Position Available
We’ve been very busy at Sematext working on our flagship products – SPM, Search Analytics, and …. more (we’ll be announcing something new soon). We’ve received great positive feedback from users and...
View ArticleEC2 Neighbour Caught Stealing CPU
We run all our services on top of AWS. We like the flexibility and the speed of provisioning and decommissioning instances. Unfortunately, this “new age” computing comes at a price. Once in a while...
View ArticleSneak Peek: Hadoop Monitoring comes to SPM
When it comes to Hadoop, they say you’ve got to monitor it and then monitor it some more. Since our own Performance Monitoring and Search Analytics services run on top of Hadoop, we figured it was...
View ArticleWhat’s New in SPM 1.11.0
We’ve been doing quite a bit of work behind the scenes in SPM. Here are a few new things in the most recent release – 1.11.0 from April 16, 2013: We’ve added a Standalone Monitor. So far the only way...
View ArticleBerlin Buzzwords 2013 – Two Talks from Sematext
Last year at Berlin Buzzwords we were proud to give three talks. Alex talked about “Real-time Analytics with HBase” (slides, video), Otis talked about large scale monitoring in his talked titled “Large...
View ArticlePoll Results: Hadoop YARN vs. pre-YARN
Back in April 2013 there was a poll in Hadoop Users LinkedIn group: YARN or pre-YARN – which version of Hadoop are you using? Because we were working on adding Hadoop monitoring to SPM, this was an...
View Article
More Pages to Explore .....