2023-11-10

Notice

Recent Posts

Recent Comments

Link

« 2025/06 »
일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Tags more

Archives

Today

Total

관리 메뉴

ㅤ

2023-11-10 본문

카테고리 없음

2023-11-10

20230919 2023. 11. 10. 17:09

머신러닝을 사용하여 예측 정확도를 구해보자

from sklearn.datasets import load_iris

from sklearn.neighbors import KNeighborsClassifier

from sklearn.metrics import accuracy_score

from sklearn.model_selection import train_test_split

iris = load_iris()

dt_clf = KNeighborsClassifier( )

# 검증 세트를 전체의 30%로, 학습 데이터를 전체의 70%로 분리

# random_state는 데이터세트를 임의로 나눌 때의 난수값으로 변경 가능

X_train, X_test, y_train, y_test = train_test_split(iris.data, iris.target,

test_size=0.2, random_state=42) #80% 학습용 20%검증용 랜덤 인자는 빼고 실험해보겠다

#random_state 는 아무 숫자나 넣어도 상관없다. #random state 랑 test size 의 수치 에따라서 결과값달라짐

dt_clf.fit(X_train, y_train) #모델학습

pred = dt_clf.predict(X_test) #학습한 데이터로 테스트

print('예측 정확도: {0:.2f}%'.format(accuracy_score(y_test,pred)))

iris 붓꽃으로 예측하는 코드이다.

그냥 알려주기만하면 그대로 받아들이기만하기때문에 난수값을 추가한다

예측 정확도: 1.00%

표를 사용해서 만들어보자

캐글로가서 csv 를 가지고오자

csv 파일에서 이름도 맞춰서 코드에다가 집어넣어줘야한다.

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

data = pd.read_csv('/content/Iris.csv')

data.plot(kind = "scatter", x = "SepalLengthCm", y = "SepalWidthCm" ,color=(0.5,0.8,0.2) ,label='point')

-----------------------------------------------------------------------------------------------------------------------------------------------------------------

https://www.kaggle.com/datasets/uciml/iris/data

Iris Species

Classify iris plants into three species in this classic dataset

www.kaggle.com

https://www.kaggle.com/code/benhamner/python-data-visualizations

Python Data Visualizations

Explore and run machine learning code with Kaggle Notebooks | Using data from Iris Species

ㅤ

2023-11-10 본문

2023-11-10

티스토리툴바