On this page
Build Indexes on Sharded Clusters
On this page
To minimize the impact of building an index on sharded clusters with replica set shards, use the following procedure to build indexes in a rolling fashion. For building an index on replica set deployments, see Build Indexes on Replica Sets instead.
The following procedure for sharded clusters deployments does take out one member of the shard replica set at a time. However, this procedure will only affect one member at a time rather than all secondaries at the same time.
Considerations
Unique Indexes
To create unique indexes using the following procedure, you must stop all writes to the collection during this procedure.
If you cannot stop all writes to the collection during this procedure, do not use the procedure on this page. Instead, build your unique index on the collection by issuing db.collection.createIndex()
on the mongos
for a sharded cluster.
Oplog Size
Ensure that your oplog is large enough to permit the indexing or re-indexing operation to complete without falling too far behind to catch up. See the oplog sizing documentation for additional information.
Prerequisites
- For building unique indexes
-
To create unique indexes using the following procedure, you must stop all writes to the collection during the index build. Otherwise, you may end up with inconsistent data across the replica set members. If you cannot stop all writes to the collection, do not use the following procedure to create unique indexes.
Warning
If you cannot stop all writes to the collection, do not use the following procedure to create unique indexes.
Procedure
A. Stop the Balancer
Connect a mongo
shell to a mongos
instance in the sharded cluster, and run sh.stopBalancer()
to disable the balancer:
sh.stopBalancer()
Note
If a migration is in progress, the system will complete the in-progress migration before stopping the balancer.
To verify that the balancer is disabled, run sh.getBalancerState()
, which returns false if the balancer is disabled:
sh.getBalancerState()
B. Determine the Distribution of the Collection
From the mongo
shell connected to the mongos
, refresh the cached routing table for that mongos
to avoid returning stale distribution information for the collection. Once refreshed, run db.collection.getShardDistribution()
for the collection you wish to build the index.
For example, if you want to an ascending index on the records
collection in the test
database:
db.adminCommand( { flushRouterConfig: "test.records" } );
db.records.getShardDistribution();
The method outputs the shard distribution. For example, consider a sharded cluster with 3 shards shardA
, shardB
, and shardC
and the db.collection.getShardDistribution()
returns the following:
Shard shardA at shardA/s1-mongo1.example.net:27018,s1-mongo2.example.net:27018,s1-mongo3.example.net:27018
data : 1KiB docs : 50 chunks : 1
estimated data per chunk : 1KiB
estimated docs per chunk : 50
Shard shardC at shardC/s3-mongo1.example.net:27018,s3-mongo2.example.net:27018,s3-mongo3.example.net:27018
data : 1KiB docs : 50 chunks : 1
estimated data per chunk : 1KiB
estimated docs per chunk : 50
Totals
data : 3KiB docs : 100 chunks : 2
Shard shardA contains 50% data, 50% docs in cluster, avg obj size on shard : 40B
Shard shardC contains 50% data, 50% docs in cluster, avg obj size on shard : 40B
From the output, you only build the indexes for test.records
on shardA
and shardC
.
C. Build Indexes on the Shards That Contain Collection Chunks
For each shard that contains chunks for the collection, follow the procedure to build the index on the shard.
C1. Stop One Secondary and Restart as a Standalone
For an affected shard, stop the mongod
process associated with one of its secondary. Restart after making the following configuration updates:
- Configuration File
- Command-line Options
If you are using a configuration file, make the following configuration updates:
- Change the
net.port
to a different port. [1] Make a note of the original port setting as a comment. - Comment out the
replication.replSetName
option. - Comment out the
sharding.clusterRole
option. - Set parameter
skipShardingConfigurationChecks
(also available for MongoDB 3.6.3+, 3.4.11+, 3.2.19+) totrue
in thesetParameter
section. - Set parameter
disableLogicalSessionCacheRefresh
totrue
in thesetParameter
section.
For example, for a shard replica set member, the updated configuration file will include content like the following example:
net:
bindIp: localhost,<hostname(s)|ip address(es)>
port: 27218
# port: 27018
#replication:
# replSetName: shardA
#sharding:
# clusterRole: shardsvr
setParameter:
skipShardingConfigurationChecks: true
disableLogicalSessionCacheRefresh: true
And restart:
mongod --config <path/To/ConfigFile>
Other settings (e.g. storage.dbPath
, etc.) remain the same.
If using command-line options, make the following configuration updates:
- Modify
--port
to a different port. [1] - Remove
--replSet
. - Remove
--shardsvr
if a shard member and--configsvr
if a config server member. - Set parameter
skipShardingConfigurationChecks
(also available for MongoDB 3.6.3+, 3.4.11+, 3.2.19+) totrue
in the--setParameter
option. - Set parameter
disableLogicalSessionCacheRefresh
totrue
in the--setParameter
option.
For example, restart your shard replica set member without the --replSet
and --shardsvr
options. Specify a new port number and set both the skipShardingConfigurationChecks
and disableLogicalSessionCacheRefresh
parameters to true:
mongod --port 27218 --setParameter skipShardingConfigurationChecks=true --setParameter disableLogicalSessionCacheRefresh=true
Other settings (e.g. --dbpath
, etc.) remain the same.
C2. Build the Index
Connect directly to the mongod
instance running as a standalone on the new port and create the new index for this instance.
For example, connect a mongo
shell to the instance, and use the db.collection.createIndex()
method to create an ascending index on the username
field of the records
collection:
db.records.createIndex( { username: 1 } )
C3. Restart the Program mongod
as a Replica Set Member
When the index build completes, shutdown the mongod
instance. Undo the configuration changes made when starting as a standalone to return to its original configuration and restart.
Important
Be sure to remove the skipShardingConfigurationChecks
parameter and disableLogicalSessionCacheRefresh
parameter.
For example, to restart your replica set shard member:
- Configuration File
- Command-line Options
If you a