Why use python for data analysis?
1. Python’s large number of libraries provide a complete toolset for data analysis (recommended learning: Python video tutorial)
2. Compared with MATLAB, R language and other languages that are mainly used for data analysis, the python language has more complete functions
3. The number of python libraries has been increasing , the method adopted to implement the algorithm is more innovative
4. Python can be easily connected to other languages, such as c, java, etc.
2 , What is IPython?
IPython is a python interactive shell (its default python shell is much easier to use and more powerful)
1. Supports automatic code completion and automatic Indentation, already supports bash shell
2, Jupyter NoteBook (formerly known as IPython NoteBook), which provides an interface for users to interact with the IPython kernel, and it is also an interactive notebook (can be saved Your source code, running results), a python web interface that integrates text (markdown), code, images, and formulas
3. Supports interactive data visualization and other graphical user interfaces
4. Support high-performance parallel computing
3. Running environment
There are many program libraries for data analysis and machine learning. These program libraries (such as : numpy, pandas, sckilearn, TensorFlow, etc.), it would be troublesome to configure and install it alone, and some packages (such as scipy) rely on many libraries; the official provides an integrated data analysis and machine learning development tool , that is, anaconda installation: download the latest version from the official website, just install it under windows
Open:
Method 1, use the command
Use the cmd command line or Linux terminal to embed the command: jupyter The notebook will run two programs: the IPython service program and the jupyter notebook web interface, and then the code can be written in the interface
Note] The IPython server is where the program runs, and jupyter notebook only provides An interactive interface, if you turn off the IPython service program (ctrl c in the terminal) jupyter notebook will be useless.
Several basic operations:
Double-click D: delete the current cell
Click M: Convert the current cell into a markdown document
Jupyter structure: It is composed of cells. The execution of each cell does not affect each other, but the data is shared
Method 2, open with anaconda interface
Method 3, open with pycharm
[Note] The compiler must select the python compiler in the anaconda directory, otherwise IPython cannot be opened Service Program
For more Python-related technical articles, please visit the Python Tutorial column to learn!
The above is the detailed content of Why choose python for data analysis. For more information, please follow other related articles on the PHP Chinese website!