Data Structure
Networking
RDBMS
Operating System
Java
MS Excel
iOS
HTML
CSS
Android
Python
C Programming
C++
C#
MongoDB
MySQL
Javascript
PHP
- Selected Reading
- UPSC IAS Exams Notes
- Developer's Best Practices
- Questions and Answers
- Effective Resume Writing
- HR Interview Questions
- Computer Glossary
- Who is Who
Database Articles - Page 21 of 618
5K+ Views
Bucketing is a method in Hive which is used for organizing the data. It is a concept of separating data into ranges known as buckets. Bucketing in hives comes helpful when the use of partitioning becomes hard. A user can determine the range of a specific bucket by the hash value. Partitioned tables can be bucketed to separate the data further to perform queries more efficiently. Every bucket is stored as a file within the table or the partition’s directories on HDFS. The records having a similar value within a column are always stored in the same bucket. Bucketing can ... Read More
891 Views
The full name of the RDD is a distributed database. Spark performance is based on this ambiguous set, enabling it to consistently cope with major data processing conditions, including MapReduce, streaming, SQL, machine learning, graphs, etc. Spark supports many programming languages, including Scala, Python, and R. RDD also supports the maintenance of material in these languages. How to create RDD Spark supports RDDS architecture in many areas, including local file systems, HDFS file systems, memory, and HBase. For the local file system, we can create RDD through the following way − val distFile = sc.textFile("file:///user/root/rddData.txt") By default, Spark takes ... Read More
362 Views
Data was previously stored in relational data management systems when Hadoop and big data concepts were not available. After introducing Big Data concepts, it was essential to store the data more concisely and efficiently. However all data stored in the related data management system needs to be transferred to the Hadoop archive. With Sqoop, we can transfer this amount of personal data. Sqoop transfers data from a related database management system to a Hadoop server. Thus, it facilitates the transfer of large volumes of data from one source to another. Here are the basic features of Sqoop − Sqoop ... Read More
578 Views
Humans go through different stages in their personal and professional lives during which they undergo situations or experiences after which privacy and information security becomes a vital requirement for them. For example, a person approaching someone from the medical profession would much appreciate his medical data remain protected from others so that he/she doesn’t face discrimination at home or work. Similarly, a company tasked with conducting surveys and running research activities is obliged to maintain strong anonymity and confidentiality measures in regard to their survey pool. In some sectors-- such as medicine, religion, and politics, to mention a ... Read More
510 Views
The act of copying data from one area to another, typically from one hard drive to another, is what we normally call as "backing up." This can be done manually or automatically. Synchronization is the process of ensuring that two different sites have the same collection of files by manually or automatically copying or deleting them as required. Syncing is a faster option, but it comes with a risk. Read through this article to find out more about Sync and Backup and how they are different from each other. What is Sync? Synchronization (sync) is the process of ensuring that ... Read More
3K+ Views
Both the Internet of Things (IoT) and Big Data are currently the trending topics that are frequently discussed in the context of the information technology industry. It is practically impossible to discuss one of these topics without also bringing up the other. Both are the wave of the future when it comes to data, and by data, we mean enormous amounts of data. We are now living in a digital age in which new things are constantly being linked to the Internet in an effort to make people's lives easier.Read through this article to get an overview of IoT and ... Read More
5K+ Views
The storing and processing of data in database management systems is one of the most significant functions that business organisations perform with their data. Users of a database management system can access their data in the way they like, because the system hides the storage details. Users are provided an abstraction of the data through the usage of database models.When it comes to building databases, the two most popular types of database models are hierarchical and relational.What is a Hierarchical Database?IBM's first attempt at developing a data model in traditional database systems, which took place in 1968, resulted in the ... Read More
15K+ Views
Sorting is the process of putting the data in a logical order so that it can be analysed in the most efficient manner possible. Searching is the action of looking for a certain record within a database. If the data are correctly organised in a predetermined manner, then the process of searching is going to be simple and time-effective. The topic of this article is trees, which are one of the most significant examples of non-linear data structures.The primary purpose of using trees to represent data is to illustrate a hierarchical link between the various components of the structure being ... Read More
1K+ Views
A large structure and a large number of people are both components of an organisation. It quickly becomes impossible to handle all of the data on an individual basis. People in this day and age typically make use of electronic systems for accomplishing this task. Private clouds and data centers are two examples of the kinds of facilities that are used by virtually all large businesses and organizations to carry out a variety of operations involving their data.What is a Private Cloud?It is well known that private clouds offer a higher level of security than public clouds since they provide ... Read More
310 Views
Let us understand the concepts of HBase and Cassandra before learning the differences between them.CassandraCassandra has a different infrastructure. Cassandra uses different DBMS along with their infrastructure. When Cassandra uses different DBMS then time complexity will increase.Cassandra supports ordered partitioning. This can lead to row size up to 10 megabytes.In Cassandra, we use seed nodes. These nodes perform inter-cluster communication. Here, we use internal communication. Casandra has lightweight transactions.Cassandra is based on the Jbury shell. But it has a specific Query language. That is CQL, it is modelled after SQL. It is better than HBase in Documentation. It uses the ... Read More