Skip to content

Questions regarding using Lance for OLAP workloads #3252

Answered by wjones127
manhld0206 asked this question in Q&A
Discussion options

You must be logged in to vote

Lance doesn't have any notion of partitioning or clustering (for example hive partitioning or data layout optimization like Z-ordering in Delta Lake). I assume that we can achieve the same optimization with secondary indexes. Is my assumption correct?

Yes. We've previously considered partitioning or clustering, but likely won't implement that. For fast filter performance, secondary indices are generally better.

I wonder what is the the better way in the future for querying lance data?

I think having a proper table provider is probably the best future. We've started some of that work recently, but I think there's more work to do for improved pushdown.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by manhld0206
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants