Hacker Newsnew | past | comments | ask | show | jobs | submit | jdwittenauer's commentslogin

Agree that Impala would fit well on this list. They didn't have any training on it, presumably because it's a Cloudera-led technology, but my understanding is it's very popular. Not sure that it truly replaces Hive/Tez though. I think they each excel at certain types of workloads.


I think the term "Hadoop" is becoming almost meaningless. It seems to now be more of a pointer referencing a basket of distributed processing technologies that run on YARN/HDFS. Agree completely with having multiple technologies to solve every problem, that's one of the most confusing parts to learn.

My own perspective is that there are lots of businesses that haven't yet needed the capabilities provided by a platform like Hadoop, but they likely will in the future. So the market may be saturated based on current needs but that market will continue to expand. Whether it's Hadoop (YARN/HDFS/etc.) that wins that market share or some other stack like Spark/Mesos remains to be seen.


> It seems to now be more of a pointer referencing a basket of distributed processing technologies that run on YARN/HDFS

You reference the MapR distribution for their training material, and its interesting that their version of HDFS is a reimplentation in C++ (MapR-FS). Its part of the reason I settled on MapR to use tools like Apache Drill, because the filesystem becomes usable to non-Hadoop tools via NFS (i.e. Awk).

Given a shift in some categories away from map-reduce to other approaches, could Hadoop eventually just become a collection of distributed filesystems and job schedulers?


None of the material in these posts could be used directly to complete assignments for the class. I suppose someone could attempt to "back-port" some of the Python code to Octave, but if you're going to that much trouble it's probably easier to just solve it in Octave in the first place.


I took the course a while back and most of the assignments were just straight copying pasting from the pdf or translating some math formula into octave and never more than 10 lines of code. It's so spoon-fed already I don't see why anyone would want to cheat by porting it from python.


Having taken the myself, I couldn't agree more.

The class is meant to introduce one to machine learning. As such, the problems are usually fairly simple and one wouldn't need to cheat unless all one is attempting to do it to solve those without looking at either the leture videos or slides.

(Translating from Python to Octave might, on the other hand, require more effort in comparison to implementing the solutions in Octave.)


I know I'm just burning karma here, but this really triggers me.

Andrew Ng gives you a free introductory course in one of the hottest topics in computing, and in exchange asks you not to do one thing. And you do that one thing. I have my solutions in octave, and it would be really convenient for me to back them up on github, but I keep them on a usb stick for this very reason. I am respecting the man's wishes who was so kind as to teach me about machine learning.

You should take them down if you don't have the explicit permission to share them as the honor code states. You don't have explicit permission, do you?


Since you linked to the honor code, can you point to anything in these blog posts that qualifies - in the context of completing the class - as "solutions to homework, quizzes, exams, projects, and other assignments"? Could anything I provided be directly used to "cheat" and finish the class without doing the work?

And just as a practical matter, there are dozens of github repos with literal (as in copy, paste, submit, done) solutions to these problems already available.

You're certainly free to disagree, but I do not view this as violating either the spirit or the intent of the honor code. This content has been out for years and is not in any way novel or unique. It's simply another vector through which the material can be learned, possibly opening it up to an even wider audience. Which is, I believe, what Andrew's goal was all along.


[flagged]


I don't see much value in continuing to debate this. You're entitled to your opinion and I respect that, but I do not share your viewpoint.


> The way I see it, Andrew Ng is an entrepreneur co-founder with a startup called Coursera. He makes his courses free, because he's a good guy, and free attracts the audience that makes his platform worth something.

I liked this class and Andrew seems like a great guy but I'd like to point out that it's no longer free to take the evaluations for Coursera courses. Coursera is a startup and needs to find a way to monetize the courses to stay in business. I have no problem with that but I think its a little disingenuous to present Coursera as a free service when its clearly not.


I see this written on HN and reddit a lot lately, yet I've just signed up and begun two courses, included Andrews ML course and have yet to be asked for money or met restrictions beyond the once in place last year.


I'm doing the ML course at the moment and I'm asked to pay for a certificate after every screen.


Yes, you are right. I should have written "forced to" instead of asked to, but now it is too late to edit.

Anyway, my point stands. I can take these courses for free, despite people on HN and reddit claiming you can't.


These are direct solutions to the exercises. I took the class some time ago and also did it in python.


As far as I can remember you can't submit assignments in Python, can you? Or maybe you did it in Python first, and then ported in Octave before submitting? If so, how did it work out for you? At first I thought I wanted to go down the same path (because I'm comfortable with Python but not with Octave) but then concluded it was too much trouble backporting everything.


As far as I can remember you can't submit assignments in Python, can you?

When I took the class earlier this year, the answer was - effectively - "no". I mean, yeah, you could do some trickery with calling Python from Octave using whatever FFI Octave has, or you could possibly reverse engineer the protocol they use to talk from your code to the upstream server... but anybody doing all that would be doing more work that just completing the assignments in Octave to begin with.


There exist implementations of an interface between Python and the Coursera grading server.

https://github.com/mstampfer/Coursera-Stanford-ML-Python is an example.


Of course somebody would have reverse engineered the protocol already. Oh well. I still think most people would find it easier to just do the assignments themselves than deal with all this, but I'll grant that there's "always one in every crowd" as they say.


We have Pytave. You don't need FFI anymore. Or rather, we wrote the FFI for you. Write your code in Scipy and Octave sees Octave-shaped objects in return.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: