Relative Content

Tag Archive for hive

Group by when condition met in Hive

I’m trying to recreate an excel pivot table in hive by selecting two variables based on a condition and then sum over the grouping variable.

Hive Metastore behind a load balancer

We have a centralized RDBMS that backs the metastore, is it possible to set up multiple metastore instances behind a load balancer (all accessing the same RDBMS server)? Are the metastore instances stateful or need to know about other instances to avoid race conditions?

Does Hive supports information schema?

I want to query all tables/columns metadata in hive. The way this is done in other databases like mssql, postgresql is to query information_schema views, but i don’t see anything like that in hive.
Can someone confirm is information_schema doesn’t exists in hive. If it doesn’t exists what would be a way to query metadata ??

Issue with hive.auto.convert.join Setting in Hive 4.0.0

I encountered an issue while using the latest version of Hive, 4.0.0. I am not sure if this is a bug. The details are as follows: I am using Hadoop version 3.3.6 and have set up a fully distributed cluster. The Hive version is 4.0.0, with one master server and one client. Currently, I am connecting to Hive via the Beeline client. When executing a join in Hive, I encountered the following error: