Just wondering can I use a string as a dimension?
I’m not sure who invented the gene naming scheme, they look like numbers but the are too sparse (from 10K to 200Billion), and very uneven. So chunking becomes really hard: it either runs out of memory because 1-2 huge chunks, or runs very slow because there are too many chunks.
So if SciDB internally build a map of string->int64 and we can use String as a dimension, that will be great.
And follow up question, if scidb allows a string dimension, does it support joins on string dimensions?
Thanks a lot!