Apache Solr 6.6.5 ¼³Ä¡ (Windows10)
1. ¸ÕÀú Apache Solr ÆÄÀÏÀ» ´Ù¿î·Îµå ¹Þ´Â´Ù.
http://apache.mirror.cdnetworks.com/lucene/solr/6.6.5/solr-6.6.5.zip
2. ¾ÐÃàÀ» Ç®°í ¿øÇÏ´Â Æú´õ¿¡ Ç®·ÁÁø Æú´õ¸¦ À§Ä¡½ÃŲ´Ù. (º»ÀÎÀº D:\Program\ Æú´õ ¾Æ·¡¿¡ º¹»çÇϵµ·Ï ÇÏ°ÚÀ½)
3. CORE »ý¼º
: »ý¼ºÇÏ´Â ÀÌÀ¯´Â Àڱ⸸ÀÇ À妽º, Çʵ弳Á¤ µîÀ» Çϱâ À§ÇÔÀÔ´Ï´Ù.
1) ¸ÕÀú solr ÇÏÀ§ÀÇ bin Æú´õ¿¡ µé¾î°©´Ï´Ù.
==> solr.cmd ¸í·É¾î¸¦ »ç¿ëÇϱâ À§Çؼ...(±ÍÂúÀ¸¸é ȯ°æ¼³Á¤ path¿¡ bin °æ·Î¸¦ µî·ÏÇϼŵµ µÇ±¸¿ä)
2) ÇØ´ç °æ·Î¿¡¼ cmdâÀ» ¿¾î¼ ¾Æ·¡ Solr¼¹ö ½ÃÀÛ+CORE ¸í·ÉÀ» ½ÇÇàÇÑ´Ù.
- ¼¹ö½ÃÀÛ : solr start
- ºê¶ó¿ìÀú ÁÖ¼Òâ¿¡ localhost:8983 ÀԷ½à ¾Æ·¡¿Í °°ÀÌ ³ª¿Â´Ù.
- CORE»ý¼º : solr create -c [¸¸µé°íÀÚ ÇÏ´Â À̸§ ¾Æ¹«°Å³ª]
¿¹ : solr create -c dasdes
- CORE»ý¼º ÈÄ ºê¶ó¿ìÀúÀÇ localhost:8983À» »õ·Î°íħÇÏ¸é ¾Æ·¡¿Í °°ÀÌ COREºÎºÐ¿¡ ¼±ÅÃÇÒ ¼ö ÀÖ´Â ÄÞº¸¹Ú½º°¡ ³ª¿Â´Ù.
3) »ý¼ºµÈ Æú´õ È®ÀÎ
D:\Program\solr-6.6.5\server\dasdes ¶ó´Â Æú´õ°¡ »õ·Î »ý°å´Ù.
4. ³×À̹öÄ«ÆäÀÇ ·ç¾À Çѱۺм®±â ¿ÀǼҽº ÇÁ·ÎÁ§Æ®¿¡¼ ÇѱÛÈ °ü·ÃµÈ ¶óÀ̺귯¸®¸¦ ´Ù¿î·Îµå ¹Þ´Â´Ù
: ³Ê¹« °¨»çÇÑ ºÐµéÀÌ ¿½ÉÈ÷ ÀÛ¾÷Çؼ ¿Ã·ÁÁֽñ⠶§¹®¿¡ Ç×»ó °¨»çÇÑ ¸¶À½À» °¡Áö°í »ç¿ëÇØ¾ß ÇÑ´Ù.
Ä«ÆäÁÖ¼Ò : https://cafe.naver.com/korlucene (´Ù¿î·Îµå¸¦ À§Çؼ´Â ȸ¿ø°¡ÀÔÀ» ÇؾßÇÑ´Ù.)
Àú´Â ÃÖ±Ù "¼ö¸í´Ô"²²¼ jarÆÄÀÏ·Î ÀÛ¾÷ÇϽŠÆÄÀÏ µÎ°³¸¦ ´Ù¿î·ÎµåÇÑ´Ù.
arirang.lucene-analyzer-7.2.1.1.jar, arirang-morph-1.1.4.jar
±×·±´ÙÀ½ ´Ù¿î·Îµå ÇÑ µÎ ÆÄÀÏÀ» D:\Program\solr-6.6.5\server\solr-webapp\webapp\WEB-INF\lib Æú´õ¿¡ ³Ö´Â´Ù.
5. managed-schema ÆÄÀÏÀ» ¼öÁ¤ÇÑ´Ù.
- ÆÄÀÏÀ§Ä¡ : D:\Program\solr-6.6.5\server\solr\dasdes\conf\managed-schema
- ÆíÁý±â·Î ¿¾î ½ºÅ©·ÑÀ» Á¦ÀÏ ¾Æ·¡·Î ³»¸°´Ù.
ÇÏ´ÜÀÇ </schema> űװ¡ Á¾·áµÇ´Â ºÎºÐÀÌ Àִµ¥ ±× ¾Õ¿¡ ¾Æ·¡ ¼³Á¤À» ºÙ¿©³Ö´Â´Ù.
<dynamicField name="*_txt_kr" type="txt_kr" indexed="true" stored="true"/> <fieldType name="txt_kr" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="org.apache.lucene.analysis.ko.KoreanTokenizerFactory"/> <!--·ç¾ÀÀÇ Factory class --> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.ClassicFilterFactory"/> <filter class="org.apache.lucene.analysis.ko.KoreanFilterFactory" hasOrigin="true" hasCNoun="true" bigrammable="false"/> <filter class="org.apache.lucene.analysis.ko.HanjaMappingFilterFactory"/> <filter class="org.apache.lucene.analysis.ko.PunctuationDelimitFilterFactory"/> <filter class="solr.KeywordMarkerFilterFactory" protected="protwords.txt"/> <filter class="solr.StopFilterFactory" words="lang/stopwords_kr.txt" ignoreCase="true"/> </analyzer> <analyzer type="query"> <tokenizer class="org.apache.lucene.analysis.ko.KoreanTokenizerFactory"/> <filter class="solr.SynonymGraphFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/> <filter class="solr.StopFilterFactory" words="lang/stopwords_kr.txt" ignoreCase="true"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.ClassicFilterFactory"/> <filter class="org.apache.lucene.analysis.ko.KoreanFilterFactory" hasOrigin="true" hasCNoun="true" bigrammable="true"/> <filter class="org.apache.lucene.analysis.ko.WordSegmentFilterFactory" hasOrijin="true"/> <filter class="org.apache.lucene.analysis.ko.HanjaMappingFilterFactory"/> <filter class="org.apache.lucene.analysis.ko.PunctuationDelimitFilterFactory"/> <filter class="solr.KeywordMarkerFilterFactory" protected="protwords.txt"/> </analyzer> </fieldType> |
- ±âº» Core »ý¼º½Ã stopwords_kr.txtÆÄÀÏÀÌ ¾ø´Âµ¥ D:\Program\solr-6.6.5\server\solr\dasdes\conf\lang ÇÏÀ§¿¡ »ý¼ºÇÏÀÚ.
- À§ ¼³Á¤¿¡ º¸ÀÌ´Â txt ÆÄÀÏ¿¡ ´ëÇÑ ¼³¸í
1) protwords.txt : ÇÕ¼º¾î °°ÀÌ ÂÉ°³Á®¼´Â ¾ÈµÇ´Â ´Ü¾îµéÀ» Àû¾î³õ´Â´Ù.
2) lang/stopwords_kr.txt : ºó¹øÈ÷ »ç¿ëµÇ´Â ´Ü¾î¿Í ¹®ÀÚ¸¦ ÁßÁö ´Ü¾î¶ó°í ¸»ÇÏ¸ç °Ë»ö Å°¿öµå¿¡¼ ÀÚµ¿À¸·Î Á¦¿ÜµÊ.
(ÀÎÅÍ³Ý °Ë»ö½Ã Ä£ÀýÇÏ°Ô Çѱ¹¾î ´Ü¾î¸¦ Á¤¸®ÇسõÀº °÷ÀÌ ¸¹ÀÌ ÀÖÀ½)
3) synonyms.txt : ¸ÂÃã¹ýÀ» ¼öÁ¤ÇϱâÀ§ÇÑ µ¿ÀÇ¾î ¼³Á¤ (¿¹ : teh -> the )
6. ¼³Á¤ÀÌ Àß Àû¿ëµÇ¾ú´ÂÁö solr ¼¹ö¸¦ restartÇغ»´Ù.
- solr stop -all
- solr start
¡Ø ´ÙÀ½½Ã°£¿¡´Â mariadb¿Í ¿¬µ¿ÇÏ¿© À妽º¸¦ °¡Á®¿À´Â°ÍÀ» Çغ¼ ¿¹Á¤ÀÌ´Ù.
Ãâó: https://dodo-it.tistory.com/33?category=749271 [ÀÌ°ÍÀú°Í Çغ¸±â]