All posts by admin

Data Science -Data Loading Techniques in Hadoop

Data science is the study of the generalizable extraction of knowledge from data, yet the key word is science. The tutorial wil give a brief understanding about Data Science.
The topics covered in the video:

1.Data Loading Techniques and Data Analysis
3.Using PIG
4.Using HIVE
5.Data Loading Using Flume
6.Data Loading Using Sqoop
7.Map Reduce Process
Related Posts:…………

Edureka is a New Age e-learning platform that provides Instructor-Led Live, Online classes for learners who would prefer a hassle free and self paced learning environment, accessible from any part of the world.
8.The Overall Map Reduce Word Count Process

Data Science Tutorials for Beginners

Data Science is all about extracting knowledge from data. Data Science is the integration of methods from mathematics, probability models, machine learning, computer programming, statistics, data engineering, pattern recognition and learning, visualization, uncertainty modelling, data warehousing, and high performance computing with the goal of extracting meaning from data and creating data products. This interdisciplinary and cross-functional field leads to decisions that move an organization forward in terms of proposed investment, decisions regarding a product or business strategy.

Data Science is a buzzword, often used interchangeably with analytics or big data. At times, Analytics is synonymous with Data Science, but at times it represents something else. A Data Scientist using raw data to build a predictive behaviour model, falls in to the category of analytics.

Data science is a steadily growing discipline that is driving significant changes across industries and in companies of every size. It is emerging as a critical source for insights for enterprises dealing with massive amounts of data.

Data Science is field of study that involves extracting meaningful insights from the data. It is a progressively growing discipline that is bringing change in the industries and companies across the world. Watch the video, which gives a detailed explanation to various concepts related to the discipline and emphasizes the Data Science in combination with the programming language R.
Following are the topics covered in the tutorial:
1.What is R?
2.Data Analysis Process
3.Why use R?
4.R: Functional Advantages
5.R Programming Concepts
6.R: Data Import Techniques
7.Processing the Data
8.Plotting Functions in R
9.Data Sub-setting: Indexing
10.Control Structures in R
11. Functions in R

Planet Big Data Feeds

Connected Enterprise Webinar – New World of Hyperconnectivity, Big Data & Real-Time Analytics

Watch Anthony Thomas, CIO at Vodafone India, speak on the New World of Hyperconnectivity, Big Data and Real-Time Analytics. This is the first part of a series of webinars, brought to you by IDG.

Vodafone India’s official YouTube channel. Watch all the award-winning Vodafone commercials, videos shot on different occasions, and videos created for Facebook right here. Visit:

Source: Vodafone India

Big Data and Data Science Events

Click here: Big Data Events

Click here: Big Data Conferences

Click here: Big Data Tech Conference, Cambridge, MA

Click here: Big Data Summits in USA.

ZENG Ming of Alibaba: Big Data is the Future of the Internet

ZENG Ming, Chief Strategy Officer at Alibaba Group, discussed the importance of intelligently analyzing Big Data to generate value, especially with unstructured data where the questions asked will change over time. Zeng spoke on the panel “Big Data: A New Frontier” with Alex Cheng, Vice President at Baidu, which was moderated by Matt Roberts, the Managing Director of USITO, at the China 2.0 Forum in Beijing held at Stanford Center at Peking University (SCPKU) on April 12, 2013.

Learn more about the China 2.0 Forum in Beijing:…

China 2.0 is an initiative of the Stanford Graduate School of Business focusing on innovation and entrepreneurship in China. Learn more:

Source: Stanford Graduate School of Business

Pivotal One and Agile Data Science

Kaushik Das discusses the success of Pivotal One, Pivotal’s new PaaS, and their integration of agile development with data science.

Source: OreillyMedia


Hans Rosling: Let my dataset change your mindset Talking at the US State Department this summer, Hans Rosling uses his fascinating data-bubble software to burst myths about the developing world. Look for new analysis on China and the post-bailout world, mixed with classic data shows.

TEDTalks is a daily video podcast of the best talks and performances from the TED Conference, where the world’s leading thinkers and doers give the talk of their lives in 18 minutes. Featured speakers have included Al Gore on climate change, Philippe Starck on design, Jill Bolte Taylor on observing her own stroke, Nicholas Negroponte on One Laptop per Child, Jane Goodall on chimpanzees, Bill Gates on malaria and mosquitoes, Pattie Maes on the “Sixth Sense” wearable tech, and “Lost” producer JJ Abrams on the allure of mystery. TED stands for Technology, Entertainment, Design, and TEDTalks cover these topics as well as science, business, development and the arts. Closed captions and translated subtitles in a variety of languages are now available on, at Watch a highlight reel of the Top 10 TEDTalks at

Source: TED Talks