[GitHub] [carbondata-site] chetandb commented on a change in pull request #69: 1.6.0

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view

[GitHub] [carbondata-site] chetandb commented on a change in pull request #69: 1.6.0

chetandb commented on a change in pull request #69: 1.6.0
URL: https://github.com/apache/carbondata-site/pull/69#discussion_r320096113

 File path: src/main/webapp/configuration-parameters.html
 @@ -624,6 +648,11 @@ <h2>
 <td>Each CarbonData file has a companion CarbonIndex file which maintains the metadata about the data. These CarbonIndex files are read and loaded into driver and is used subsequently for pruning of data during queries. These CarbonIndex files are very small in size(few KB) and are many. Reading many small files from HDFS is not efficient and leads to slow IO performance. Hence these CarbonIndex files belonging to a segment can be combined into  a single file and read once there by increasing the IO throughput. This configuration enables to merge all the CarbonIndex files into a single MergeIndex file upon data loading completion.<strong>NOTE:</strong> Reading a single big file is more efficient in HDFS and IO throughput is very high. Due to this the time needed to load the index files into memory when query is received for the first time on that table is significantly reduced and there by significantly reduces the delay in serving the first query.</td>
+<td>To configure Ranges-based Compaction to be used or not for RANGE_COLUMN. If true after compaction also the data would be present in ranges.</td>
 Review comment:
   Change ranges to range in "To configure Ranges-based Compaction "

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services