样本 Mflix 数据集
在此页面上
sample_mflix
数据库包含有关电影和电影院的数据。数据库还包含某些元数据的集合,包括用户和对特定电影的评论。
要了解如何将 Atlas 提供的样本数据加载到集群中,请参阅加载样本数据。
集合
sample_mflix
数据库包含以下集合:
集合名称 | 说明 |
---|---|
包含与特定电影相关的评论。 | |
包含 sample_mflix.movies 集合中 Western 、Action 和 Fantasy 类电影的相关信息,并添加了 plot_embedding 字段,该字段使用 OpenAI 的 text-embedding-ada-002 嵌入模型创建,您可以使用该模型通过 Atlas Vector Search 进行查询。 | |
包含电影信息,其中包括发行年份、导演和评价。 | |
元数据字段。包含用户的 JSON web token。 | |
包含电影院的位置。 | |
包含用户信息。 |
sample_mflix.comments
此集合包含与特定电影相关的评论。每个文档都包含评论文本、提交评论的用户以及评论相应的电影。
索引
该集合包含以下索引:
名称 | Index | 说明 |
---|---|---|
_id_ | { "_id": 1 } | _id 字段的主键索引。 |
文档样本
{ "_id": { "$oid": "5a9427648b0beebeb69579cc" }, "name": "Andrea Le", "email": "andrea_le@fakegmail.com", "movie_id": { "$oid": "573a1390f29313caabcd418c" }, "text": "Rem officiis eaque repellendus amet eos doloribus. Porro dolor voluptatum voluptates neque culpa molestias. Voluptate unde nulla temporibus ullam.", "date": { "$date": { "$numberLong": "1332804016000" } } }
sample_mflix.embedded_movies
此集合包含有关 Western
、Action
或 Fantasy
genres
的电影的详细信息。每个文档都包含一部电影以及电影名称、上映年份和演员阵容等信息。
此外,此集合的文档还包含一个 plot_embedding
字段,其中包含使用 OpenAI 的 text-embedding-ada-002
嵌入模型创建的嵌入,您可以将其与 Atlas Vector Search 功能结合使用。
索引
该集合包含以下索引:
名称 | Index | 说明 |
---|---|---|
_id_ | { "_id": 1 } | _id 字段的主键索引。 |
文档样本
注意
为了便于阅读,示例中的维数已被截断。
{ "_id": { "$oid": "573a1396f29313caabce582d" }, "plot": "A young swordsman comes to Paris and faces villains, romance, adventure and intrigue with three Musketeer friends.", "genres": ["Action", "Adventure", "Comedy"], "runtime": { "$numberInt": "106" }, "rated": "PG", "cast": ["Oliver Reed", "Raquel Welch", "Richard Chamberlain", "Michael York"], "num_mflix_comments": { "$numberInt": "0" }, "poster": "https://m.media-amazon.com/images/M/MV5BODQwNmI0MDctYzA5Yy00NmJkLWIxNGMtYzgyMDBjMTU0N2IyXkEyXkFqcGdeQXVyMjI4MjA5MzA@._V1_SY1000_SX677_AL_.jpg", "title": "The Three Musketeers", "lastupdated": "2015-09-16 06:21:07.210000000", "languages": ["English"], "released": { "$date": { "$numberLong": "133747200000" } }, "directors": ["Richard Lester"], "writers": ["George MacDonald Fraser (screenplay)", "Alexandre Dumas père (novel)"], "awards": { "wins": { "$numberInt": "4" }, "nominations": { "$numberInt": "7" }, "text": "Won 1 Golden Globe. Another 3 wins & 7 nominations." }, "year": { "$numberInt": "1973" }, "imdb": { "rating": { "$numberDouble": "7.3" }, "votes": { "$numberInt": "11502" }, "id": { "$numberInt": "72281" } }, "countries": ["Spain", "USA", "Panama", "UK"], "type": "movie", "tomatoes": { "viewer": { "rating": { "$numberDouble": "3.5" }, "numReviews": { "$numberInt": "9600" }, "meter": { "$numberInt": "78" } }, "dvd": { "$date": { "$numberLong": "982022400000" } }, "critic": { "rating": { "$numberDouble": "7.1" }, "numReviews": { "$numberInt": "11" }, "meter": { "$numberInt": "82" } }, "lastUpdated": { "$date": { "$numberLong": "1441307415000" } }, "rotten": { "$numberInt": "2" }, "production": "Live Home Video", "fresh": { "$numberInt": "9" } }, "plot_embedding": [ -0.004237316, -0.022958077, -0.005921211, -0.020323543, 0.010051459 ] }
sample_mflix.movies
此集合包含有关电影的详细信息。每个文档都包含一部电影以及电影名称、上映年份和演员阵容等信息。
索引
该集合包含以下索引:
文档样本
{ "_id": { "$oid": "573a1390f29313caabcd413b" }, "title": "The Arrival of a Train", "year": { "$numberInt": "1896" }, "runtime": { "$numberInt": "1" }, "released": { "$date": { "$numberLong": "-2335219200000" } }, "poster": "http://ia.media-imdb.com/images/M/MV5BMjEyNDk5MDYzOV5BMl5BanBnXkFtZTgwNjIxMTEwMzE@._V1_SX300.jpg", "plot": "A group of people are standing in a straight line along the platform of a railway station, waiting for a train, which is seen coming at some distance. When the train stops at the platform, ...", "fullplot": "A group of people are standing in a straight line along the platform of a railway station, waiting for a train, which is seen coming at some distance. When the train stops at the platform, the line dissolves. The doors of the railway-cars open, and people on the platform help passengers to get off.", "lastupdated": "2015-08-15 00:02:53.443000000", "type": "movie", "directors": [ "Auguste Lumière", "Louis Lumière" ], "imdb": { "rating": { "$numberDouble": "7.3" }, "votes": { "$numberInt": "5043" }, "id": { "$numberInt": "12" } }, "cast": [ "Madeleine Koehler" ], "countries": [ "France" ], "genres": [ "Documentary", "Short" ], "tomatoes": { "viewer": { "rating": { "$numberDouble": "3.7" }, "numReviews": { "$numberInt": "59" } }, "lastUpdated": { "$date": { "$numberLong": "1441993589000" } } }, "num_mflix_comments": { "$numberInt": "1" } }
sample_mflix.sessions
该集合包含有关 用户 的元数据。每个文档都包含一个用户及其相应的 JSON Web 令牌
索引
该集合包含以下索引:
名称 | Index | 说明 | 属性 |
---|---|---|---|
_id_ | { "_id": 1 } | _id 字段的主键索引。 | |
user_id_1 | { "user_id" : 1} | user_id 字段的升序索引。 |
文档样本
{ "_id": { "$oid": "5a98348755593fdf68350932" }, "user_id": "bfb9vc1zz@xhasq.5h9", "jwt": "eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9..." }
sample_mflix.theaters
此集合包含电影院位置。每个文档都包含一个电影院及其位置,采用字符串和GeoJSON形式。
索引
文档样本
{ "_id": { "$oid": "59a47286cfa9a3a73e51e72c" }, "theaterId": { "$numberInt": "1000" }, "location": { "address": { "street1": "340 W Market", "city": "Bloomington", "state": "MN", "zipcode": "55425" }, "geo": { "type": "Point", "coordinates": [ { "$numberDouble": "-93.24565" }, { "$numberDouble": "44.85466" } ] } } }
sample_mflix.users
该集合包含 mflix
个用户的信息。每个文件都包含一个用户及其姓名、电子邮件和密码。
索引
名称 | Index | 说明 | 属性 |
---|---|---|---|
_id_ | { "_id": 1 } | _id 字段的主键索引。 | |
email_1 | { "email: 1 } | email 字段上的唯一升序索引。 |
文档样本
{ "_id": { "$oid": "59b99db4cfa9a34dcd7885b6" }, "name": "Ned Stark", "email": "sean_bean@gameofthron.es", "password": "$2b$12$UREFwsRUoyF0CRqGNK0LzO0HM/jLhgUCNNIJ9RJAqMUQ74crlJ1Vu" }