Questions regarding using Lance for OLAP workloads #3252
-
Hello everyone. Recently I had the opportunity to consider Lance as a data format for building data lake for both ML and OLAP workloads. I loved that Lance has much better support for both quick random access and huge column compared to parquet! Through the experiment I have some question regarding using Lance for OLAP workloads and would love to hear about them 🙏
I'm looking forward to the answer and thank you all for your work! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Yes. We've previously considered partitioning or clustering, but likely won't implement that. For fast filter performance, secondary indices are generally better.
I think having a proper table provider is probably the best future. We've started some of that work recently, but I think there's more work to do for improved pushdown. |
Beta Was this translation helpful? Give feedback.
Yes. We've previously considered partitioning or clustering, but likely won't implement that. For fast filter performance, secondary indices are generally better.
I think having a proper table provider is probably the best future. We've started some of that work recently, but I think there's more work to do for improved pushdown.