-
Essay / The external and internal characteristics of Big Data
Big data resembles a flow of data. The abundance of data is expanding day by day. Big Data focuses on the huge amount of data. Data can be in structured, unstructured and semi-structured form. Structured data consists of text files that can be displayed in rows and columns. It can be easily treated. Unstructured data is the opposite of structured data. Data cannot be displayed in a relational database. The example of unstructured data can be word processing document, presentation, audio, video, email and also many other business documents. The third category concerns semi-structured data in which XML, JSON and NoSQL databases appear. Say no to plagiarism. Get a tailor-made essay on “Why Violent Video Games Should Not Be Banned”? Get an original essayThe term big data is strongly related to unstructured data. We can say that 80% of big data is unstructured. In reality, Big Data refers to data that is not managed by traditional databases. Traditional database system stores the data in gigabytes, while in big data it stores the data in petabytes, exabytes, zettabytes, etc. Companies need to retain or hire highly experienced personnel for in-depth analytical insight into Big Data. The era of Big Data continues to grow on popular social sites like Facebook and Twitter. The understanding of Big Data will differ in business, technological and industrial terms. McKinsey challenged the following five units in which data is growing rapidly. These are healthcare, public sector, retail, manufacturing and personal location data. The main advantage of Big Data is that it provides scalability and data analysis. Examples of Big Data are found in real-world scenarios such as banking, social media, web data, and any type of daily transactions. Complete definition of Big Data with these five Vs: volume, variety, velocity, veracity, value. Here are the 5 Vs of Big Data, explained in simple language. Volume: In terms of Big Data, the word “big” defines volume. in the future, data will be expressed in zettabytes. From social networking sites, a large amount of data is shared. Here are some interesting statistics that show the volume of data. According to 1 second live internet statistics, there are: 64,551 Google searches 7,886 tweets on Twitter 822 Instagram photos uploaded in 1 second 72,179 YouTube videos viewed in 1 second 2,655,007 emails sent in 1 second, including including spam 52,180 GB of internet traffic in 1 second 2.5 million pieces of content shared by Facebook users 571 websites created every minute of the day Variety: As I have discussed types of structured, semi-structured data and unstructured. These types of data are difficult to manage by a traditional database system. Different types of data are called manifold. Nowadays, a lot of structured data is generated. Speed: The speed at which data is created, called velocity. Some examples of data generated by social media sites are tweets on Twitter, statuses/comments/shares on Facebook and many others. Data is generated in real-time, near real-time, hourly, daily, weekly, monthly and yearly, in batches, etc. Veracity: data conformity. Veracity attributes include data accuracy, integrity, and authenticity. This leads to.