Java代码执行时间问题

2024-05-23 21:42:05 发布

您现在位置:Python中文网/ 问答频道 /正文

在我的网络扩散研究中,我有下面的代码为顶点建模一个轻量级框架。最初的原型来自python框架,我将其翻译成Java。我遇到的问题是,虽然这段代码比它的python版本运行得快得多,最多可以运行10000个顶点,但是对于更多的顶点(100000+),它会停止运行。实际上,python版本在1.2分钟内执行,而java构建即使在执行7分钟后也没有返回。我不知道为什么相同的代码会在更多的顶点崩溃,我需要帮助修复代码。你知道吗

import java.util.*;

public class Vertex
{
  private int id;
  private HashMap<Integer, Double> connectedTo;
  private int status;

  public Vertex(int key)
  {
    this.id = key;
    this.connectedTo = new HashMap<Integer, Double>();
    this.status = 0;
  }

  public void addNeighbour(int nbr, double weight)
  {
    this.connectedTo.put(nbr, weight);
  }

  public int getId()
  {
    return this.id;
  }

  public double getWeight(int nbr)
  {
    return this.connectedTo.get(nbr);
  }

  public int getStatus()
  {
    return this.status;
  }

  public Set<Integer> getConnections()
  {
    return this.connectedTo.keySet();
  }

//testing the class

  public static void main(String[] args)
  {
    int noOfVertices = 100000;

    Vertex[] vertexList = new Vertex[noOfVertices];

    for (int i = 0; i < noOfVertices; i++) {
        vertexList[i] = new Vertex(i);
    }

    for (Vertex v : vertexList) {
        int degree = (int)(500*Math.random()); //random choice of degree 
        int neighbourCount = 0; // count number of neighbours built up

        while (neighbourCount <= degree) {
            int nbr = (int) (noOfVertices * Math.random()); // randomly choose a neighbour
            double weight = Math.random(); // randomly assign a weight for the relationship
            v.addNeighbour(nbr, weight);
            neighbourCount++;
        }
    }

  }
}

作为参考,此代码的python版本如下:

import random

class Vertex:
    def __init__(self, key):
      self.id = key
      self.connectedTo = {}

    def addNeighbor(self, nbr, weight=0):
      self.connectedTo[nbr] = weight

    def __str__(self):
      return str(self.id) + ' connectedTo: ' \
          + str([x.id for x in self.connectedTo])

    def getConnections(self):
      return self.connectedTo.keys()

    def getId(self):
      return self.id

    def getWeight(self, nbr):
      return self.connectedTo[nbr]

if __name__ == '__main__':
  numberOfVertices = 100000
  vertexList = [Vertex(i) for i in range(numberOfVertices)] # list of vertices

  for vertex in vertexList:
    degree = 500*random.random() 
    # build up neighbors one by one
    neighbourCount = 0 

    while neighbourCount <= degree:
        neighbour = random.choice(range(numberOfVertices))
        weight = random.random() # random choice of weight
        vertex.addNeighbor(neighbour, weight)
        neighbourCount = neighbourCount + 1

Tags: 代码selfidforreturndefrandompublic
1条回答
网友
1楼 · 发布于 2024-05-23 21:42:05

这是一个非常有趣的问题,我相信我也学到了一些新的东西。我试着用不同的方式优化代码,比如使用并行流,以及使用ThreadLocalRandom,这比Random快三倍。然而,我最终发现了主要的瓶颈:分配给JVM的内存。你知道吗

因为有太多元素被添加到Map(最坏的情况是500000个顶点,100000个顶点),所以需要大量内存(堆空间)。如果您允许JVM动态分配内存,那么程序将需要很长时间才能执行。我解决这个问题的方法是通过将-Xms3G作为VM参数应用到程序的运行配置(可以在IDE中或通过终端完成)来预分配内存到JVM(特别是3gb)。你知道吗

我还对您的代码进行了一些优化,我将在下面发布这些代码(我只需几秒钟即可完成):

import java.util.*;
import java.util.concurrent.*;
import java.util.stream.*;

public class Test {

    private static final ThreadLocalRandom RANDOM = ThreadLocalRandom.current();

    public static void main(String[] args) {
        int noOfVertices = 100_000;

        Vertex[] vertexList = new Vertex[noOfVertices];

        IntStream.range(0, noOfVertices).parallel().forEachOrdered(i -> {
            vertexList[i] = new Vertex(i);

            int degree = (int) (500 * RANDOM.nextDouble()); // random choice of degree

            for (int j = 0; j <= degree; j++) {
                int nbr = (int) (noOfVertices * RANDOM.nextDouble()); // randomly choose a neighbor

                vertexList[i].addNeighbour(nbr, RANDOM.nextDouble());
            }
        });
    }

}

class Vertex {

    private int id;

    private Map<Integer, Double> connectedTo;

    private int status;

    public Vertex(int id) {
        this.id = id;

        this.connectedTo = new HashMap<>(500);
    }

    public void addNeighbour(int nbr, double weight) {
        this.connectedTo.put(nbr, weight);
    }

    public int getId() {
        return this.id;
    }

    public double getWeight(int nbr) {
        return this.connectedTo.get(nbr);
    }

    public int getStatus() {
        return this.status;
    }

    public Set<Integer> getConnections() {
        return this.connectedTo.keySet();
    }

}

我不确定在多线程环境中使用ThreadLocalRandom的显式结果,但是如果您愿意,可以将其切换回Math#random。你知道吗

相关问题 更多 >