Azure ML Studio supports many data type formats which are given in dropdown and auto selected based on the uploaded file format. Mostly you can see each type has two selections, one with header and one without header, to specify if weather data has header row or not. The most-used formats are CSV, TSV and zip files. Along with these, you can see plain.txt and R Objects data types.
Once it is loaded, data will be available in Dataset in left navigation menu. There you can review all uploaded datasets for past experiments and ready to use for future experiments.
To utilize a dataset, next you have to create a blank experiment. Again you have to click “NEW +” toolbar button and select blank experiment. On new experiment canvas you can see your uploaded dataset and sample data sets all together under Saved Data like a tree view, example is given in image.
In Azure ML, an experiment is very similar to flowcharts in which you can easily understand data flow along with nodes.
Here we can see some canvas features which are useful when you are doing a big experiment. See the image for a by-the-numbers description.
- User based zoom ratio input bar.
- 1:1 is used to auto zoom in to actual size.
- Zoom to Fit- use to zoom selected node to fit in screen.
Now we are ready to start our experiment, you can drag and drop dataset component on canvas. You can see a node is added on canvas with a link point. In our case we have only one direction entry point as it is a starting point of the experiment. These node points are used to link data flow with other nodes or visualized datasets. To visualize dataset right click on node and select “Visualize” as per the example image.
Now we can see how to convert data sets in different formats. For this you can use the Convert Dataset module. Drag and drop module on canvas and link both modules for data conversion. There are many conversion formats available and those are Convert to ARFF, Convert To CSV, Convert To dataset, Convert to SVMLight, Convert to TSV. As per your need you can select data conversion component and convert it. See the example image.
I hope you understood all covered points clearly and you can see the next portion in other blogs. Keep exploring. Happy programming!!