Cloudera Data Science Workbench enables secure self-service data science for the enterprise. It is a collaborative environment where developers can work with a variety of libraries and frameworks.
N/A
DataRobot
Score 8.9 out of 10
N/A
The DataRobot AI Platform is presented as a solution that accelerates and democratizes data science by automating the end-to-end journey from data to value and allows users to deploy AI applications at scale. DataRobot provides a centrally governed platform that gives users AI to drive business outcomes, that is available on the user's cloud platform-of-choice, on-premise, or as a fully-managed service. The solutions include tools providing data preparation enabling users to explore and…
Organizations which already implemented on-premise Hadoop based Cloudera Data Platform (CDH) for their Big Data warehouse architecture will definitely get more value from seamless integration of Cloudera Data Science Workbench (CDSW) with their existing CDH Platform. However, for organizations with hybrid (cloud and on-premise) data platform without prior implementation of CDH, implementing CDSW can be a challenge technically and financially.
DataRobot can be used for risk assessment, such as predicting the likelihood of loan default. It can handle both classification and regression tasks effectively. It relies on historical data for model training. If you have limited historical data or the data quality is poor, it may not be the best choice as it requires a sufficient amount of high-quality data for accurate model building.
DataRobot helps, with algorithms, to analyze and decipher numerous machine-learning techniques in order to provide models to assist in company-wide decision making.
Our DataRobot program puts on an "even playing field" the strength of auto-machine learning and allows us to make decisions in an extremely timely manner. The speed is consistent without being offset by errors or false-negatives.
It encompasses many desired techniques that help companies in general, to reconfigure in to artificial intelligence driven firms, with little to no inconvenience.
The platform itself is very complicated. It probably can't function well without being complicated, but there is a big training curve to get over before you can effectively use it. Even I'm not sure if I'm effectively using it now.
The suggested model DataRobot deploys often not the best model for our purposes. We've had to do a lot of testing to make sure what model is the best. For regressive models, DataRobot does give you a MASE score but, for some reason, often doesn't suggest the best MASE score model.
The software will give you errors if output files are not entered correctly but will not exactly tell you how to fix them. Perhaps that is complicated, but being able to download a template with your data for an output file in the correct format would be nice.
DataRobot presents a machine-learning platform designed by data scientists from an array of backgrounds, to construct and develop precise predictive modeling in a fraction of the time previously taken. The tech invloved addresses the critical shortage of data scientists by changing the speed and economics of predictive analytics. DataRobot utilizes parallel processing to evaluate models in R, Python, Spark MLlib, H2O and other open source databases. It searches for possible permutations and algorithms, features, transformation, processes, steps and tuning to yield the best models for the dataset and predictive goal.
Cloudera Data Science Workbench has excellence online resources support such as documentation and examples. On top of that the enterprise license also comes with SLA on opening a ticket to Cloudera Services and support for complaint handling and troubleshooting by email or through a phone call. On top of that it also offers additional paid training services.
As I am writing this report I am participating with Datarobot Engineers in an complex environment and we have their whole support. We are in Mexico and is not common to have this commitment from companies without expensive contract services. Installing is on premise and the client does not want us to take control and they, the client, is also limited because of internal IT regulations ,,, soo we are just doing magic and everybody is committed.
Both the tools have similar features and have made it pretty easy to install/deploy/use. Depending on your existing platform (Cloudera vs. Azure) you need to pick the Workbench. Another observation is that Cloudera has better support where you can get feedback on your questions pretty fast (unlike MS). As its a new product, I expect MS to be more efficient in handling customers questions.
I've done machine learning through python before, however having to code and test each model individually was very time consuming and required a lot of expertise. The data Robot approach, is an excellent way of getting to a well placed starting point. You can then pick up the model from there and fine tune further if you need.