Using Solr in mixi.

#SolrJP 8th.

mixi, Inc.

System H.Q., Tech Div., R & D Group.

HARUYAMA Seigo(@haruyama)

日本語 / English

Self Introduction

Using Solr in mixi.

Full-Text Search in mixi

We adopted following solutions before 2011 (and now use them).

we have been constructing and renewing search systems using Solr, since 2011.

Anuenue

Our open-source product(Apache Licence 2.0).

a Solr wrapper, which can build Master-Slave Full-Text Search System.

Anuenue : Logical composition

Anuenue: History

We use Anuenue 0.7.x in mixi.

Search Systems using Anuenue in mixi

We also use Anuenue for search systems in our office.

Anuenue in mixi : Physical composition

We use 1 Master and 2 Slave(and Merger) servers on our most systems.

Features of our typical search system.

And we share physical servers with 2 or 3 services.

We are going to build bigger systems (index size: over 100GB).

Physical composition: Figure

Future goal

Custom Function Query which retrieves data from external storage

Custom Function Query: Figure

Custom Function Query: Demo

Demo on my machine.

Text mining using Solr

Number of documents and Index size

We keep indexes for the last 102 days in 600GB SSD.

Softwares

Fields in Document

Sample: number of posts on 2012/05

number of women’s posts on 2012/05

number of men’s posts on 2012/05

facet of age to women’s posts on 2012/05

facet of age to men’s posts on 2012/05

facet of age to women’s posts including ‘AKB’ on 2012/05

facet of age to men’s posts including ‘AKB’ on 2012/05

Composition

Why is index for last 1-day separated?

Expansion

Positive/Negative analysis

Positive/Negative Analysis: Score Ranking

Burst Keyword

Burst Keyword: 3/23

burst keyword: 5/21

Future goal

Summary