Short term courses on data science topics...

Short term courses on data science topics...

Data science is now a buzzword with more hype than sense. Anyone who does anything with data becomes a data scientist, just like calling any website as being in a cloud. Other relatives of this buzzword are big data, analytics, etc and there are second level relatives like text analytics, recommender systems, predictive analytics, and the like. Most big data courses is about Hadoop, with a bit of map reduce. Occasionally a bit of R, which by the way, is not a bad idea. Often the "science" and the "analytics" of the data goes to a second or third place, behind these popular jargon tools.

Data science has different aspects: acquisition, storage, processing, etc in terms of the different stages. Orthogonal to this is the structure of the data -- text, images, video, numbers, records, etc and its semantics. This defines the storage standards, and the basic access primitives. The volume of the data is a major, but not the only major, component. The other 'v's like velocity, veracity, etc are also important in devising efficient algorithms for acquisition, storage and processing. When it comes to processing, there is an ocean ahead. The purpose of processing is a key driver -- prediction, modeling, trend analysis, outlier analysis, clustering, and so on are examples. Each offers a multitude of algorithms depending on the many factors mentioned above. When the data is not adequately clean or complete, there is another aspect coming into play -- data preprocessing, which includes data transformation as well.

Thus looks the space of data science, and it is not to be reduced to hadoop, R, or any of the few popular tools. Tools are, just tools -- a distinction that is often lost, in our obsession with tools. Quite like a fresh CS graduate saying "Java" or ".NET" is his favourite topic in computer science!

CDAC Mumbai is putting together a few courses in the broad space of data science. They cover little islands in the space outlined earlier. Connect these islands to the broad space, and you can use them effectively.

The courses cover R (an open source and excellent tool with great support for data representation, statistical processing, and visualisation), predictive analytics (one of the popular interest area in the 'processing' segment), and text analytics (a high potential area, but with many hurdles). We will use mostly open source tools in the course, so that you can go back and practice them on your own. And many of these tools are as good (and more extendable!) as their commercial counterparts, for most requirements.

Please check the site kbcs.in/datascience for more details, registration information, etc. There will be more courses coming up later, looking at some of the other areas.

Vivek Kumar

Senior Backend Engineer | Retail & Connected Cars Platform Developer

8 年

Sir, what's the difficulty level for the course and will it cover significant sections in this short amount of time ? (please answer specifically to predictive analytics course) > Can you please name the other two courses(as mentioned in blog) so that I can finalize my registration ?

Renato Pires dos Santos

Founder & Lead Developer at academicum.ai | Building AI-Powered Tools to Enhance Academic Research and Learning

8 年

I don't know about this CDAC Mumbai course but I'm quite happy with Coursera Data Science Specialization, as it includes R Programming, Cleaning Data, Exploratory Data Analysis, Statistical Inference, Regression Models, and Machine Learning, closing with a capstone project on developing a data science product.

Shridhar Bhat

Founder and CEO at Sarvaha Systems | Trusted Software Development Partner

8 年

Great idea, M Sasikumar! How would these compare to similar courses from Coursera (especially Andrew Ng and Manning)?

回复

要查看或添加评论,请登录

M Sasikumar的更多文章

  • Switching to online teaching

    Switching to online teaching

    Thanks to the sudden onslaught of Corona and the consequent lockdown running into many weeks, academic systems have…

    6 条评论
  • Expert Systems Nostalgia

    Expert Systems Nostalgia

    Ed Feigenbaum, Joshua Lederberg, Bruce Buchanan, William Clancy, Randall Davis, ..

    3 条评论
  • The “new” AI – or is it?

    The “new” AI – or is it?

    The air these days is full of various forms of the buzzword: AI. Almost every company is investing in it.

    31 条评论
  • Introducing G-compris: A gift for your child

    Introducing G-compris: A gift for your child

    Any app-space or software repository today, is full of games of various shades and types, mostly aimed at children…

    4 条评论
  • Manage your programming labs using Parikshak

    Manage your programming labs using Parikshak

    Numerous studies indicate that the practical skills in a typical computer science or IT graduate is quite poor. Many…

    18 条评论
  • Being a teacher....

    Being a teacher....

    Yesterday morning, in one of my whatssap groups, I got a forward-message evangelising the greatness of a teacher. Many…

    14 条评论
  • Writing a Review

    Writing a Review

    Much has been and is still being written about the deteriorating quality of research publications, and perhaps research…

    6 条评论
  • RSIC -- an amazing initiative at IIT Madras

    RSIC -- an amazing initiative at IIT Madras

    A casual meeting last week with Prof Mangala Sunder, of IIT Madras and closely associated with the NPTEL movement in…

    6 条评论
  • Olabs -- online labs for schools

    Olabs -- online labs for schools

    It was a few years back that we (CDAC Mumbai and Amrita Univ) got together with this idea of building virtual labs for…

    7 条评论
  • Reflections on being a teacher…

    Reflections on being a teacher…

    Last week, I was attending the LaTiCE conference -- a good international conference in the area of learning and…

社区洞察

其他会员也浏览了