Webinar: Solr To The Rescue: Successful Migration From FAST ESP to Open Source Search Based on Apache Solr.

On Nov 18, 2010, 14:00 EST (19:00 GMT),  join us for the Webinar, organized by Lucid Imagination:  Solr To The Rescue: Successful Migration From FAST ESP to Open Source Search Based on Apache Solr.

Michael McIntosh, VP Search Solutions, TNR Global will be on the webinar panel along with Eric Gaumer, Chief Architect, ESR Technology, and Helge Legernes, Founding Partner & CTO, Findwise discussing approaches for migrations from FAST ESP to Open Source Search based on Apache Solr.

Webinar description:

Users of FAST ESP have become increasingly concerned since the purchase of Fast Search and Transfer by Microsoft in 2008. The discontinuation of the Linux platform and the subordination of FAST features into Sharepoint have created a greater sense of urgency to seek out alternatives. Many FAST ESP users are considering open source enterprise search based on Apache Solr as an option. This roundtable discussion will provide valuable insights for search users looking to make the change to Solr.

The panel covers factors driving the need for a FAST ESP alternatives, differences between FAST and Solr, typical migration project lifecycle & methodology, complementary open source tools, migration pro’s & con’s, best practices and customer examples, and recommended next steps.

Register

Resilient Businesses Summit

The Resilient Business Summit, held on November 13, 2010, was spear-headed by Rich Roth, CEO of TNR Global. Hidden Tech and TNR Global co-sponsored the summit. Small business owners attended the summit and participate din pro-active dialogue on topics ranging from economic stability, current and future policies, best practices, and program support, all based around the theme of creating and sustaining Resilient Small Businesses.

Migration from FAST ESP to Lucene Solr

Download the presentation and see the video.

Michael McIntosh, Vice President of Enterprise Search Technologies at TNR, spoke at the Lucene Revolution conference in Boston, MA October 7-8, 2010. Michael reviewed the migration from Fast ESP to Lucene/Solr open source search. He discussed approaches to identifying core content areas of HTML documents such as Text-To-Tag Ratio Heuristics and Page Stereotype/Site Template Analysis, and reviewed specific use cases that we have encountered as search integration experts and discuss available tools.

TNR Global was a sponsor of Lucene Revolution. The conference gathered over 400 professionals from the enterprise search industry. We were happy to see so much interest in Lucene/Solr open source search, and get to know and learn from the folks who have done large scale implementations, including Twitter, LinkedIn, and eHarmony.  Not surprisingly, there was a lot of interest about migration from proprietory search systems to Solr, especially from FAST ESP due to Microsoft’s discontinuing FAST ESP support for Linux.  If you would like to learn more about how a migration from FAST ESP to Lucene Solr can benefit your company, contact us for a free consultation.

Migration from FAST ESP to Lucene Solr – Presentation

Download the presentation and see the video.

Michael McIntosh, Vice President of Enterprise Search Technologies at TNR, spoke at the Lucene Revolution conference in Boston, MA October 7-8, 2010. Michael reviewed the migration from Fast ESP to Lucene/Solr open source search. He discussed approaches to identifying core content areas of HTML documents such as Text-To-Tag Ratio Heuristics and Page Stereotype/Site Template Analysis, and reviewed specific use cases that we have encountered as search integration experts and discuss available tools.

from-fast-esp-to-solr

Apache Solr is a winner of this year’s Infoworld BOSSIE Award

SolrApache Solr is a winner of this year’s Infoworld BOSSIE Award for the best open source applications. Other winners included Alfresco, WordPress, and Drupal – so Solr is in good company. This award once again indicates that Solr has proved itself to be a robust search application.

http://www.infoworld.com/d/open-source/bossie-awards-2010-the-best-open-source-applications-150&current=10&last=1#slideshowTop

Michael McIntosh, TNR Global, speaking at Lucene Revolution – The First Conference Dedicated to Open Source Enterprise Search

mmphotoMichael McIntosh, Vice President of Enterprise Search Technologies at TNR, will be presenting at the Lucene Revolution conference in Boston, MA October 7-8, 2010.

Michael will review the migration from commercial search platforms (focusing on Fast ESP) to Lucene/Solr open source search. He will discuss approaches to identifying core content areas of HTML documents such as Text-To-Tag Ratio Heuristics and Page Stereotype/Site Template Analysis, and will review specific use cases that we have encountered as search integration experts and discuss available tools.

For more information see:
http://lucenerevolution.com/speakers-bios#McIntosh

New Website for Innovara, Inc.

TNR Global Joomla! Services launched the new Innovara, Inc. site this summer. Innovara, with headquarters in Hadley, MA and with a global reach, provides highly customized training, medical thought leadership, and business development services for the healthcare, pharmaceutical, biotechnology, and medical device industries.
The new Innovara website was created with the Joomla! Content Management System. Highlights include a custom design provided by Meg McCarthy Design, document management, multilingual home pages, and the implementation of advanced access control list capabilities, to create custom groups of staff and clients who can access and add content to specific areas of the website.

Intermediate Joomla! Workshop held August 5th, 2010 in Hadley, MA

Tamar Schanfeld of TNR Global led an all day Intermediate Joomla! Workshop on August 5th, 2010 in Hadley, MA.  Topics included: backups, security, search engine optimization, social networking, blogs, and more.

Participant feedback was very positive!   “Lots of info – very pleased,” “Exceeded expectations,” “A very patient instructor”.

Interested in taking a Joomla! workshop? Email us at info@tnrglobal.com and we’ll notify you of upcoming Joomla! classes and events. TNR Global also provides online training and small group training for businesses. Contact us today!

For those who were not able to attend – TNR will be posting a Joomla! tip of the day on our TNR Joomla! Blog.

TNR Global to present at Lucene Revolution Conference

Lucene Revolution Conference Logo
TNR Global has been selected to present at the Lucene Revolution conference in Boston, MA October 7-8, 2010. Michael McIntosh, Vice President of Enterprise Search Technologies at TNR, will speak on Friday, October 8th regarding the migration from commercial search platforms (focusing on Fast ESP) to Lucene/Solr open source search. He will discuss approaches to identifying core content areas of HTML documents such as Text-To-Tag Ratio Heuristics and Page Stereotype / Site Template Analysis, will review specific use cases that we have encountered as search integration experts and discuss available tools.

Lucene Revolution is the first conference dedicated to open source search in North America. The two-day conference is packed with technical sessions, developer content, user case studies, panels, and networking opportunities. Attendees will learn new ways to develop, deploy, and enhance search applications using Lucene/Solr. For more information, and to register, visit http://www.lucenerevolution.com.

Dynamic Fields in Apache Solr

So, you’ve installed a fresh copy of Apache Solr. You have tested it out running the examples from the Solr tutorial. And now you are ready to start indexing some of your own data. Just one problem. The fields for your data are not recognized by the default Solr instance. You notice in the schema.xml file that the default fields have names like cat, weight, subject, includes, author, title, payloads, popularity, price, etc. These fields are defined for the purpose of being used with the sample data provided with Solr. Most of their names are likely not relevant to your dataset, and even if you can manage to make things “fit” with misnamed fields even just for the purpose of experimenting, you also face the problem that their set properties may not be what you would expect them to be.

Of course you can modify the schema.xml file and apply strong data-typing to each field that you plan to use to fit the exact needs of your project, reload Solr, and then start to index your data. But if you are just getting started with Solr, or starting a new project and experimenting with adding to your dataset, you may not know exactly what fields you need to define or what properties to define for them. Or you might be interested updating an existing index with some additional fields, but do not want to explicitly add them to the schema.

Fortunately, Solr gives the option to define dynamic fields. Further, there are pre-defined dynamic fields for many of the common data-types in the default schema. Here are the some of the dynamic fields that are found in the default schema.xml:

<dynamicField name="*_i"  type="int"    indexed="true"  stored="true"/>
<dynamicField name="*_s"  type="string"  indexed="true"  stored="true"/>
<dynamicField name="*_l"  type="long"   indexed="true"  stored="true"/>
<dynamicField name="*_t"  type="text"    indexed="true"  stored="true"/>
<dynamicField name="*_b"  type="boolean" indexed="true"  stored="true"/>
<dynamicField name="*_f"  type="float"  indexed="true"  stored="true"/>
<dynamicField name="*_d"  type="double" indexed="true"  stored="true"/>
<dynamicField name="*_dt" type="date"    indexed="true"  stored="true"/>

The field names are defined with a glob-like pattern that is either at the beginning or end of the name. With the above dynamic fields, you can index data with field names that begin with any valid string and end in one of the suffixes in the name attributes (i.e. article_title_s, article_content_t, posted_date_dt, etc.) and Solr will dynamically create any dynamic field of the particular type with the name that you give it.

<add>
<doc>
<field name="article_title_s">My Article</field>
<field name="article_content_t">Lorem Ipsum...</field>
<field name="posted_date_dt">1995-12-31T23:59:59Z</field>
</doc>
</add>

After you’ve indexed some data, you can actually view the dynamic field names in the schema viewer, located at http://YOUR-INSTANCE/admin/schema.jsp

Using dynamic fields is a great way to get started at using Apache Solr with minimal setup.