使用量子计算机的距离估计

问题描述

我做了一个小型基准测试，将算法的量子版本与其经典版本进行比较，我发现量子计算与经典版本相比花费了太多时间。

我不明白为什么会发生这种情况，它应该与经典相似或更好。

DataSet 说明：1 个测试数据点和 3 个训练数据点，维度 = 2。目标：我们的目标是将测试数据点归入训练数据点类别之一。

import matplotlib.pyplot as plt
import pandas as pd
from numpy import pi
from qiskit import Aer,execute
from qiskit import QuantumCircuit
from qiskit import QuantumRegister,ClassicalRegister
from qiskit import IBMQ
import os
import time

# IBMQ Configure
# IBMQ.save_account(os.environ.get('IBM'))
# IBMQ.load_account()
# provider = IBMQ.get_provider('ibm-q')
# qcomp = provider.get_backend('ibmq_16_melbourne')
##

fig,ax = plt.subplots()
ax.set(xlabel='Data Feature 1',ylabel='Data Feature 2')

# Get the data from the .csv file
data = pd.read_csv('data.csv',usecols=['Feature 1','Feature 2','Class'])

# Create binary variables to filter data
isGreen = data['Class'] == 'Green'
isBlue = data['Class'] == 'Blue'
isBlack = data['Class'] == 'Black'

# Filter data
greenData = data[isGreen].drop(['Class'],axis=1)
blueData = data[isBlue].drop(['Class'],axis=1)
blackData = data[isBlack].drop(['Class'],axis=1)

# This is the point we need to classify
y_p = 0.141
x_p = -0.161

# Finding the x-coords of the centroids
xgc = sum(greenData['Feature 1']) / len(greenData['Feature 1'])
xbc = sum(blueData['Feature 1']) / len(blueData['Feature 1'])
xkc = sum(blackData['Feature 1']) / len(blackData['Feature 1'])

# Finding the y-coords of the centroids
ygc = sum(greenData['Feature 2']) / len(greenData['Feature 2'])
ybc = sum(blueData['Feature 2']) / len(blueData['Feature 2'])
ykc = sum(blackData['Feature 2']) / len(blackData['Feature 2'])

# Plotting the centroids
plt.plot(xgc,ygc,'gx')
plt.plot(xbc,ybc,'bx')
plt.plot(xkc,ykc,'kx')

# Plotting the new data point
plt.plot(x_p,y_p,'ro')

# Setting the axis ranges
plt.axis([-1,1,-1,1])

plt.show()

# Calculating theta and phi values
phi_list = [((x + 1) * pi / 2) for x in [x_p,xgc,xbc,xkc]]
theta_list = [((x + 1) * pi / 2) for x in [y_p,ykc]]

#----- quantum start time -------#
st = time.time()
# Create a 2 qubit QuantumRegister - two for the vectors,and
# one for the ancillary qubit
qreg = QuantumRegister(3)

# Create a one bit ClassicalRegister to hold the result
# of the measurements
creg = ClassicalRegister(1)

qc = QuantumCircuit(qreg,creg,name='qc')

# Get backend using the Aer provider
backend = Aer.get_backend('qasm_simulator')

# Create list to hold the results
results_list = []

# Estimating distances from the new point to the centroids
for i in range(1,4):
    # Apply a Hadamard to the ancillary
    qc.h(qreg[2])

    # Encode new point and centroid
    qc.u(theta_list[0],phi_list[0],qreg[0])
    qc.u(theta_list[i],phi_list[i],qreg[1])

    # Perform controlled swap
    qc.cswap(qreg[2],qreg[0],qreg[1])
    # Apply second Hadamard to ancillary
    qc.h(qreg[2])

    # Measure ancillary
    qc.measure(qreg[2],creg[0])

    # run on quantum computer
    # job = execute(qc,backend=qcomp,shots=1024)
    # job_monitor(job)

    # Reset qubits
    qc.reset(qreg)

    # Register and execute job
    job = execute(qc,backend=backend,shots=1024)
    result = job.result().get_counts(qc)
    results_list.append(result['1'])

et = time.time()
# --------- end time ----------

print(results_list)
print('final circuit fig')
print(qc.draw())

# Create a list to hold the possible classes
class_list = ['Green','Blue','Black']

# Find out which class the new data point belongs to 
# according to our distance estimation algorithm
quantum_p_class = class_list[results_list.index(min(results_list))]

# Find out which class the new data point belongs to 
# according to classical euclidean distance calculation

# classical start time
cst = time.time()
distances_list = [((x_p - i[0]) ** 2 + (y_p - i[1]) ** 2) ** 0.5 for i in [(xgc,ygc),(xbc,ybc),(xkc,ykc)]]
cet = time.time()

classical_p_class = class_list[distances_list.index(min(distances_list))]


# Print time taken
print("classical time => ",cet-cst)
print("quantum time => ",et-st)

# Print results
print("""According to our distance algorithm,the new data point belongs to the""",quantum_p_class,'class.\n')
print('Euclidean distances: ',distances_list,'\n')
print("""According to euclidean distance calculations,classical_p_class,'class.')

输出：

classical time =>  **1.0967254638671875e-05**
quantum time =>  **0.2530648708343506**  // more time
According to our distance algorithm,the new data point belongs to the Blue class.

Euclidean distances:  [0.520285324797846,0.4905204028376393,0.7014755294377704] 

According to euclidean distance calculations,the new data point belongs to the Blue class.

我无法理解，为什么量子计算需要这么多时间。

解决方法

我是一名物理学家和程序员，曾在 Qiskit 上进行过广泛的工作。我在机器学习等方面的经验有限，但如果我没记错的话，figure 13 on page 22 of this paper on Nearest-Neighbor methods 正是您正在创建的电路。

因为您使用经典算法模拟量子硬件，所以性能受到了显着影响。这是注释掉的：

# IBMQ Configure
# IBMQ.save_account(os.environ.get('IBM'))
# IBMQ.load_account()
# provider = IBMQ.get_provider('ibm-q')
# qcomp = provider.get_backend('ibmq_16_melbourne')

其中“ibmq_16_melbourne”是指具有 the ibm architecture which is partially documented here 的物理量子计算机。这完全有道理，因为 IBM 限制了大多数帐户的访问。这就是为什么后来你有这个：

# Get backend using the Aer provider
backend = Aer.get_backend('qasm_simulator')

“Aer”是指在您的客户端计算机本地运行的量子计算机模拟软件。据我所知，qiskit 中还没有可以模拟特定物理量子计算机的东西。这大概会告诉您模拟/理论加速比是多少（尽管在经典计算机上模拟需要更长的时间）。

重要提示：许多定义为 Qiskit 生态系统一部分的标准（如 OpenQASM 格式）旨在与硬件无关。您可以描述任意两个量子位在任何时间相互作用的电路。但事实是，任何规模的物理量子计算机（就 10 多个量子位而言）都不会具有任何量子位到任何其他量子位的直接连接。您必须以特定于该架构的方式进行交换（例如墨尔本 16 量子位架构）。

euclidean-distance machine-learning python quantum-computing