/ /

/ /

標準アナライザ

standardアナライザは、すべてのMongoDB Search インデックスとクエリのデフォルトです。テキストを単語の境界に基づいてタームに分割するため、ほとんどのユースケースで言語に依存しません。すべてのタームが小文字に変換され、句読点が削除されます。これは、メールアドレス、頭字語、中国語、日が照合、文字、英数字などを認識する文字列ベースのトークン化を提供します。

Refine Your Index を選択した場合、Atlas UI はIndex Configurations セクション内に View text analysis of your selected index configuration というタイトルのセクションを表示します。このセクションを展開すると、Atlas UI には、standardアナライザが各サンプル列に対して生成するインデックスと検索トークンが表示されます。 Atlas UI Visual Editor でインデックスを作成または編集すると、standardアナライザが組み込みサンプルドキュメントとクエリ文字列用に作成するトークンが確認できます。

重要

MongoDB Search は、アナライザトークンのサイズが 32766 バイトを超える、string フィールドのインデックスません。キーワードアナライザを使用している場合、32766 バイトを超える string フィールドはインデックス化されません。

例

次のインデックス定義の例では、 standardアナライザを使用して、sample_mflix.moviesコレクションの titleフィールドのインデックスを指定します。この例に従うには、クラスターにサンプルデータをロードし、mongosh を使用するか、MongoDB 検索インデックスの作成チュートリアルの手順に従って Atlas UI の Create a Search Index ページに移動します。

次に、movies コレクションをデータソースとして使用し、または Atlasmongosh UIVisual Editor またはJSON editor からインデックスを作成する例の手順に従います。

➤ 言語を選択 ドロップダウンメニューを使用して、このページの例のインターフェイスを設定します。

インデックスを設定するには、 Refine Your Indexをクリックします。
Index Configurations セクションで、Dynamic Mapping を off に切り替えます。
Field Mappingsセクションで、 Add Field MappingをクリックしてAdd Field Mappingウィンドウを開きます。
[Customized Configuration] をクリックします。
Field Nameドロップダウンからtitleを選択します。
[ Data Type String選択されていない場合は選択します。

String Propertiesを展開し、次の変更を加えます。

インデックスアナライザ	まだ選択されていない場合は、ドロップダウンから [ `lucene.standard` ] を選択します。
searchAnalyzer	まだ選択されていない場合は、ドロップダウンから [ `lucene.standard` ] を選択します。
インデックスオプション	デフォルトの`offsets`を使用します。
Store	デフォルトの`true`を使用します。
上記を無視	デフォルト設定のままにしてください。
基準	デフォルトの`include`を使用します。

[Add] をクリックします。
[Save Changes] をクリックします。
[Create Search Index] をクリックします。

デフォルトのインデックス定義を、以下のインデックス定義で置き換えます。

{
  "mappings": {
    "fields": {
      "title": {
        "type": "string",
        "analyzer": "lucene.standard"
      }
    }
  }
}

[Next] をクリックします。
[Create Search Index] をクリックします。

1 db.movies.createSearchIndex(
2   "default",
3     {
4       "mappings": {
5         "fields": {
6           "title": {
7             "type": "string",
8             "analyzer": "lucene.standard"
9           }
10         }
11       }
12     }
13 )

次のクエリは、 titleフィールドでactionというタームを検索し、出力を 2 つの結果に制限します。

インデックスの Query ボタンをクリックします。
クエリを編集するには、Edit Query をクリックします。
クエリバーをクリックし、データベースとコレクションを選択します。

デフォルトのクエリを以下のように置き換え、Find をクリックします。

[
  {
    "$search": {
      "text": {
        "query": "action",
        "path": "title"
      }
    }
  }
]

SCORE: 4.001246452331543  _id: "573a1398f29313caabceac7f"
  awards: Object
  cast: Array (4)
  countries: Array (1)
  directors: Array (1)
  fullplot: "Jericho "Action" Jackson is a Detroit police sergeant who was demoted …"
  genres: Array (3)
  imdb: Object
  languages: Array (1)
  lastupdated: "2015-08-20 00:24:08.470000000"
  num_mflix_comments: 2
  plot: "Vengence drives a tough Detroit cop to stay on the trail of a power hu…"
  poster: "https://m.media-amazon.com/images/M/MV5BZWFhNmI3OWQtOTU5Zi00ODA3LWExNj…"
  rated: "R"
  released: 1988-02-12T00:00:00.000+00:00
  runtime: 96
  title: "Action Jackson"
  tomatoes: Object
  type: "movie"
  writers: Array (1)
  year: 1988
 SCORE: 4.001246452331543  _id:  "573a1399f29313caabcec6b7"
   awards: Object
   cast: Array (4)
   countries: Array (1)
   directors: Array (1)
   fullplot: "Jeb Ward is an attorney who specializes in whistle blower, David vs. Goliath cases. When a major auto manufacturer is accused of knowingly marketing a defective car, Ward takes on the case, risking everything."
   genres: Array (2)
   imdb: Object
   languages: Array (1)
   lastupdated: "2015-09-06 00:42:51.167000000"
   metacritic: 58
   num_mflix_comments: 2
   plot: "Jeb Ward is an attorney who specializes in whistle blower, David vs. Goliath cases."
   poster: "https://m.media-amazon.com/images/M/MV5BNWY5Mjk4ZmItMTAzYS00NWE3LWEzYz…"
   rated: "R"
   released: 1991-03-15T00:00:00.000+00:00
   runtime: 110
   title: "Class Action"
   tomatoes: Object
   type: "movie"
   writers: Array (3)
   year: 1991
 SCORE: 4.001246452331543  _id:  "573a13b0f29313caabd333e7"
   awards: Object
   cast: Array (4)
   countries: Array (2)
   directors: Array (1)
   fullplot: "Vishi is a local goon who basically likes beating up people to get his way. He falls in love with Khushi, but his life takes a turn when he crosses paths with a ruthless mafia kingpin."
   genres: Array (3)
   imdb: Object
   languages: Array (1)
   lastupdated: "2015-09-10 17:20:55.697000000"
   num_mflix_comments: 1
   plot: "In order to escape crime and clutches of a ruthless mafia, a Bangkok based goon seeks help from his lookalike, a Mumbai based criminal."
   released: 2014-12-05T00:00:00.000+00:00
   runtime: 144
   title: "Action Jackson"
   tomatoes: Object
   type: "movie"
   writers: Array (3)
   year: 2014
 SCORE: 4.001246452331543  _id:  "573a13d2f29313caabd913dc"
   awards: Object
   countries: Array (1)
   directors: Array (1)
   genres: Array (2)
   imdb: Object
   languages: Array (1)
   lastupdated: "2015-06-15 03:15:33.967000000"
   num_mflix_comments: 0
   released: 2011-01-28T00:00:00.000+00:00
   runtime: 45
   title: "Slow Action"
   tomatoes: Object
   type: "movie"
   writers: Array (1)
   year: 2011
 SCORE: 3.440462112426758  _id:  "573a1399f29313caabcedcb0"
   awards: Object
   cast: Array (4)
   countries: Array (1)
   directors: Array (1)
   fullplot: "Young Danny Madigan is a big fan of Jack Slater, a larger-than-life action hero. When Danny is given a magic ticket, he is transported into Slater's latest adventure."
   genres: Array (3)
   imdb: Object
   languages: Array (1)
   lastupdated: "2015-08-31 00:11:25.920000000"
   metacritic: 44
   num_mflix_comments: 0
   plot: "With the help of a magic ticket, a young film fan is transported into the fictional world of his favorite action movie character."
   poster: "https://m.media-amazon.com/images/M/MV5BNjdhOGY1OTktYWJkZC00OGY5LWJhY2…"
   rated: "PG-13"
   released: 1993-06-18T00:00:00.000+00:00
   runtime: 130
   title: "Last Action Hero"
   tomatoes: Object
   type: "movie"
   writers: Array (4)
   year: 1993
 SCORE: 3.440462112426758  _id:  "573a139af29313caabcf0e84"
   awards: Object
   cast: Array (4)
   countries: Array (1)
   directors: Array (1)
   fullplot: "Jan Schlichtmann, a tenacious lawyer, is addressed by a group of families whose children have died from leukemia. He takes on a case against two large companies accused of dumping toxic waste."
   genres: Array (1)
   imdb: Object
   languages: Array (1)
   lastupdated: "2015-09-07 00:21:34.950000000"
   metacritic: 68
   num_mflix_comments: 0
   plot: "The families of children who died sue two companies for dumping toxic waste."
   poster: "https://m.media-amazon.com/images/M/MV5BZmEzNjhiZWEtNTM5OS00ZmQyLThhYj…"
   rated: "PG-13"
   released: 1999-01-08T00:00:00.000+00:00
   runtime: 115
   title: "A Civil Action"
   tomatoes: Object
   type: "movie"
   writers: Array (2)
   year: 1998
 SCORE: 2.687220573425293  _id:  "573a1398f29313caabcead32"
   awards: Object
   cast: Array (4)
   countries: Array (1)
   directors: Array (1)
   fullplot: "Colonel James Braddock has a Vietnamese wife who was supposed to leave Vietnam with him, but was left behind. Years later, he returns to rescue her and their son."
   genres: Array (2)
   imdb: Object
   languages: Array (1)
   lastupdated: "2015-08-17 00:28:26.627000000"
   num_mflix_comments: 0
   plot: "Colonel James Braddock has a Vietnamese wife who was supposed to leave Vietnam with him, but was left behind."
   poster: "https://m.media-amazon.com/images/M/MV5BZTRjODU0MTUtMjBmMi00ZTBmLTk2MD…"
   rated: "R"
   released: 1988-03-02T00:00:00.000+00:00
   runtime: 101
   title: "Braddock: Missing in Action III"
   tomatoes: Object
   type: "movie"
   writers: Array (5)
   year: 1988
 SCORE: 2.687220573425293  _id:  "573a13a9f29313caabd1f502"
   awards: Object
   cast: Array (4)
   countries: Array (2)
   directors: Array (1)
   fullplot: "Bugs Bunny and Daffy Duck are up to their feuding ways again. Tired of all the attention going to Bugs, Daffy persuades the studio to let him go off on his own."
   genres: Array (3)
   imdb: Object
   languages: Array (1)
   lastupdated: "2015-08-28 00:30:31.080000000"
   metacritic: 64
   num_mflix_comments: 1
   plot: "The Looney Tunes search for a man's missing father and the mythical Blue Monkey diamond."
   poster: "https://m.media-amazon.com/images/M/MV5BMTkxNDk5MDQ2MF5BMl5BanBnXkFtZT…"
   rated: "PG"
   released: 2003-11-14T00:00:00.000+00:00
   runtime: 91
   title: "Looney Tunes: Back in Action"
   tomatoes: Object
   type: "movie"
   writers: Array (1)
   year: 2003
 SCORE: 2.4220800399780273  _id:  "573a13bdf29313caabd5acfe"
   awards: Object
   cast: Array (4)
   countries: Array (1)
   directors: Array (1)
   fullplot: "Fueled by the belief that another world is possible, acclaimed filmmaker Velcrow Ripper takes us on a journey through the spiritual activism movement."
   genres: Array (1)
   imdb: Object
   languages: Array (1)
   lastupdated: "2015-08-12 00:50:56"
   num_mflix_comments: 1
   plot: "Captures the exciting movement of Spiritual Activism that is exploding around the planet."
   poster: "https://m.media-amazon.com/images/M/MV5BMTI2MjM2Mzk2NV5BMl5BanBnXkFtZT…"
   released: 2008-10-01T00:00:00.000+00:00
   runtime: 90
   title: "Fierce Light: When Spirit Meets Action"
   tomatoes: Object
   type: "movie"
   writers: Array (1)
   year: 2008

db.movies.aggregate([
  {
    "$search": {
      "text": {
         "query": "action",
         "path": "title"
      }
    }
  },
  {
    "$limit": 2
  },
  {
    "$project": {
      "_id": 0,
      "title": 1
    }
  }
])

[
  {
    title: 'Action Jackson'
  },
  {
    title: 'Class Action'
  }
]

MongoDB Search がこれらのドキュメントを返すのは、クエリタームaction とドキュメントのトークン action が一致しているためです。MongoDB Searchはlucene.standardアナライザを使用して、titleフィールドのテキストに対して次の操作を実行して作成しました。

テキストを小文字に変換します。
単語の境界に基づいてテキストを分割し、個別のトークンを作成します。

次の表は、 MongoDB Search が Standard Analyzer を使用して作成するトークン（検索可能なターム）と、結果内のドキュメント用にMongoDB Search がキーワードアナライザおよび空白アナライザ用に作成するトークンを示しています。

タイトル	標準アナライザトークン	キーワードアナライザトークン	空白アナライザトークン
`Action Jackson`	`action`, `jackson`	`Action Jackson`	`Action`, `Jackson`
`Class Action`	`class`, `action`	`Class Action`	`Class`, `Action`

次のインデックスを使用してフィールドをインデックスする場合:

キーワードアナライザ、MongoDB Search では、クエリタームaction の結果内のドキュメントが一致しません。keywordアナライザは、検索タームがフィールドの内容全体に一致するドキュメントのみと一致するためです（Action Jackson と Class Action ）を含むドキュメントをインポートしようとすると、エラーを返します。
ホワイトスペースアナライザ、 MongoDB Searchaction whitespacetitleActionactionwhitespaceでは、クエリタームのアナライザ内のドキュメントがアナライザトークンと一致しない。

戻る

アナライザ

簡単

1	db.movies.createSearchIndex(
2	"default",
3	{
4	"mappings": {
5	"fields": {
6	"title": {
7	"type": "string",
8	"analyzer": "lucene.standard"
9	}
10	}
11	}
12	}
13	)