Hive is a data warehousing tool for querying large datasets in Hadoop. It allows SQL-like queries using HiveQL. Hive uses a metastore to store metadata about tables and partitions. Data can be loaded into tables from files on HDFS or other data stores. Common commands include CREATE, INSERT, SELECT, JOIN, and ALTER. Partitioning and bucketing can improve query performance. Views and indexes can also be created for optimization.