So for the last few months I’ve been doing a year long course on “Big Data”, which is just data science and data analytics . The skills we are learn are Python, R,SQL and MS Excel. We’ll be doing the flowing certs MS Excel Expert, MTA in Python and an Oracle SQL one . For me the Python and SQL are the most important.
I’ve played with Python in the past ,and I don’t have a in depth knowledge, which kind of worried me a bit , but after watching some youtube videos , I realize I don’t need to master Python , I just need to know enough Python to get the job done the rest I can learn.What I need to know is pandas, matplotlib, and numpy ( and maybe nltk ), also Jupyter note book is handy too . I have also found I have natural ability for SQL and that the programming involved in data science suites me, I was never going to to be a software developer or engineer , I was always someone who wrote scripts on the fly to get the job done.
I am a Harry Potter fan , and found someone did Natural Language Processing on the books and have a txt file of all the books on their GitHub, so that what I am going to do, analyse the books. .Second on Kaggel I found data set on Superheros , which looks fun, so also will analyses that.I will post the results here and on my GitHub.